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ANTISENSE OLIGONUCLEOTIDE WHICH INHIBITS EXPRESSION OF THE 
OB-RGRP PROTEIN AND METHOD FOR DETECTING COMPOUNDS WHICH 
MODIFY THE INTERACTION BETWEEN PROTEINS OF THE OB-RGRP 

FAMILY AND THE LEPTIN RECEPTOR 

The present application relates to antisense oligonucleotides which inhibit 
expression of the OB-RGRP protein and to uses thereof for preventing and/or 
treating leptin-related pathological conditions. 

It also relates to a method for detecting leptin receptor ligands using the 
energy transfer between, firstly, fusion proteins composed of leptin receptors 
and of energy-donor or -acceptor proteins and, secondly, fusion proteins 
composed of OB-RGRP or of MY047 and of energy-donor or -acceptor 
proteins. 

It also relates to fusion proteins for implementing this method. 
Leptin is a 16 kDa protein secreted mainly by the adipose tissue, which binds 
to a receptor (OB-R) belonging to the cytokine receptor family. Five 
membrane-bound isoforms of this receptor have been identified, and derive 
from alternative splicing of the same gene. These isoforms which have the 
same extracellular and transmembrane domain are characterized by 
intracellular domains of varying sizes (Tartaglia et al (1995) Cell 83, 1263- 
1271). A soluble form of the receptor has also been identified and comes from 
an alternative splicing or a proteolytic cleavage of the extracellular domain of 
the membrane-bound forms. The short form of the receptor (OB-Rs), which 
appears to be involved in transporting leptin across the blood-brain barrier, is 
the most expressed isoform. The long form (OB-RI) is only expressed in a few 
tissues, such as the hypothalmus, and appears to be responsible for most of 
the biological effects of leptin (Sweeney, G. (2002) Cell Signal 14, 655-663). 
Leptin and its receptor have been the subject of particular attention due to 
their involvement in the regulation of energy balance and of the metabolism, 
and in the neuroendocrine response to food intake. Recently, it has been 
shown that leptin is also involved in important addition functions, such as 
regulation of the bone mass, angiogenesis, thrombus formation, sexual 
maturation, hematopoiesis, the regulation of immunity and inflammation, fetal 
development and cancer. The administration of leptin to leptin-deficient 
organisms such as mice (ob/ob) and certain humans causes a decrease in 
the lipid mass in various tissues, such as the liver and the adipose tissue 
(Halaas et al. (1995) Science 269, 543-546, Pelleymounter et al. (1995) 
Science 269, 540-543, Campfield et al. (1995) Science 269, 546-549, Farooqi 
et al. (1999) N Engl J Med 341, 879-884). This treatment with leptin also 
improves the sensitivity to insulin and decreases the fatty mass in mice and 
humans exhibiting lipodistrophy (Shimomura et al. (1999) Nature 401, 73-76, 
Oral et al. (2002) New England Journal of Medicine 346, 570-578, Petersen et 
al. (2002) J Clin Invest 109, 1345-1350. Obese individuals are generally 
resistant to leptin. The reasons for this resistance are still poorly understood, 
but several mechanisms have been suggested: a deficiency in leptin transport 
across the blood-brain barrier, a deficiency in activation of OB-R or in the 
signaling by these receptors, and the overexpression of negative regulators 
such as SOCS3 and PTP-1B (Bjorbaek et al. (2000) J Biol Chem 275, 40649- 
40657, Cheng et al. (2002) Developmental Cell 2, 497-503, Cook and Unger 
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(2002) Developmental Cell 2, 385-387. Understanding the mechanisms of 
resistance to leptin requires a more detailed characterization of the 
mechanisms involved in OB-R activation. 

OB-R is constitutively associated with janus kinase 2 (JAK 2). The binding of 
5 JAK2 to the receptor is critical for the signaling by OB-R and has been 

proposed as being involved in stabilizing the OB-R receptor dimers. Activation 
by agonists is thought to cause a change in conformation in the 
juxtamembrane region of the cytoplasm tail of the OB-R. JAK2, which is 
constitutively linked to the boxl motif in this region, is activated by 

10 autophosphorylation and then phosphorylates the OB-RI receptor but not the 

OB-Rs receptor. The phosphorylation of OB-RI allows anchoring of STAT 
proteins, which bind to the receptor and are activated by phosphorylation of 
tyrosine. The activated STAT proteins dimerize and translocate into the 
nucleus in order to stimulate the transcription of genes via STAT response 

15 elements Tartaglia (1997) J Biol Chem 272, 6093-6096. 

Recently, a second promoter for the leptin receptor has been discovered. 
Interestingly, a second transcript is co-expressed with the OB-R messengers 
from this promoter. This transcript has been observed in several species, 
such as mice, rats, humans, yeast and C. elegans, Bailleul et al. (1997) 

20 Nucleic Acids Res 25, 2752-2758. In situ hybridization experiments confirm 

the coexpression of OB-R and of the associated gene in the brain of mice, 
including the hypothalamic regions involved in regulating body weight (Mercer 
et al., J Neuroendocrinol 2000 July; 12(7):649-55). The corresponding protein 
is composed of 131 amino acids and is called OB-R-gene related protein (OB- 

25 RGRP). This protein was the subject of patent application WO 98/05792. 

The fact that OB-RGRP is expressed in yeast and nematodes, which are 
organisms lacking leptin receptors, indicates a more general role for OB- 
RGRP, supported by the deletion of this protein in yeast which causes a 
deficiency in transport of proteins from the golgi to the vacuoles (Belgareh- 

30 Touze et al. (2002) Molecular Biology Of The Cell 13, 1694-1708). 

In 2002, a cDNA called MY047 was cloned from a human brain cDNA library 
(16). The function of the corresponding protein is still unknown. MY047 
exhibits 68% homology with OB-RGRP, suggesting that these two proteins 
belong to the same family. Analysis of the sequences available for the human 

35 genome sequencing project shows that no other homolog exists. 



The applicants have endeavored to determine the role of OB-RGRP and its 
relationships with leptin receptors. 

40 

They have thus shown the specificity of the interactions between OB-RGRP 
and the OBRs receptor. 

They have also shown that it is possible to specifically modify the expression 
of leptin receptors at the cell surface using antisense oligonucleotides 
45 directed against the leptin receptor gene associated protein (OB-RGRP). 



A subject of the present application is therefore optionally modified 
oligonucleotides comprising from 8 to 50 nucleotides which hybridize 
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specifically with the sequence SEQ ID No. 1 and which inhibit OB-RGRP 
expression. 

Advantageously, these oligonucleotides promote the expression of leptin 
receptors at the cell surface. 

5 

Preferentially, these oligonucleotides are antisense oligonucleotides. 
Preferentially, these oligonucleotides comprise a sequence exhibiting at least 
60%, 70%, 80% or 90% identity with the sequence SEQ ID No. 2. 

10 According to an advantageous embodiment, in these oligonucleotides, 

nucleotides are thioesterified. 

According to another advantageous embodiment, in these oligonucleotides, 
nucleotides are 2'-0-methylated. 
15 According to another advantageous embodiment, these oligonucleotides have 

a triethylene glycol residue at their 3' ends. 

Although the most commonly used form of antisense compounds is in the 
form of antisense oligonucleotides, the present invention includes 

20 oligonucleotide derivatives and compounds which mimic their structure, such 

as those described hereinafter, without this list being limiting. The antisense 
compounds in agreement with this invention preferably comprise from 8 to 50 
nucleobases (i.e. they are oligomers made up of 8 to 50 nucleotide units). The 
antisense compounds particularly targeted are antisense oligonucleotides, 

25 more specially those which are made up of approximately 12 to 30 

nucleobases. The antisense compounds comprise ribozymes, oligozymes or 
other short catalytic RNAs or catalytic oligonucleotides which hybridize with 
the target nucleic acid and modulate its expression. A nucleoside is a 
combination of a nitrogenous base and a sugar. The base of a nucleoside is 

30 generally a heterocyclic nitrogenous base. The two most common types of 

heterocyclic base are purine and pyrimidine bases. The nucleotides are 
nucleosides which carry a phosphate group covalently bonded to the sugar of 
the nucleoside. For the nucleosides comprising a pentanofuranose, the 
phosphate may be bonded to the hydroxyl at position 2', 3' or 5' of the sugar. 

35 The formation of nucleotides comes from the covalent attachment of the 

phosphate group to two adjacent nucleosides, which makes it possible, step 
by step, to obtain a linear oligomer. The two ends of such a linear polymer 
can, in turn, join together to form a circular structure, but the open structure is 
generally preferred. In the nucleotide structure, the phosphate groups are 

40 considered to form the internucleoside skeleton of the oligonucleotide. The 

normal bond in the RNA or DNA skeleton is a 3'-5' phosphodiester linkage. 
Specific examples of antisense compounds which can be used in this 
invention include oligonucleotides containing a modified backbone or 
unnatural internucleoside bonds. Thus, oligonucleotides with a modified 

45 backbone comprise those which conserve a phosphate atom in their skeleton 

and those which are lacking therein. For the needs of the present invention, 
modified oligonucleotides which do not have a phosphorus atom in their 
internucleoside bond can, nevertheless, be considered to be oligonucleotides. 
The backbone of these modified oligonucleotides may comprise, for example, 
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the following groups: phosphorothioates, chiral phosphorothioates, 
phosphorodithioates, phosphotriesters, aminoalkylphosphotriesters, methyl 
and other alkyl phosphonates, including 3'-alkylene phosphonates, 5'-alkylene 
phosphonates and chiral phosphonates, phosphinates, phosphoramidates, 
5 including 3'-aminophosphoramidate and aminoalkylphosphoramidates, 

thionophosphoramidates, thionoalkylphosphonates, 
thionoalkylphosphotriesters, selenophosphates and borophosphates which 
form normal 3'-5' bonds, and analogs thereof which form 2'-5' bonds, and also 
those which exhibit a reverse polarity, i.e. comprising at least one 

10 internucleoside bond of the 3'-3', 5'-5' or 2'-2' type. The form of 

oligonucleotides having a reverse polarity which is preferentially used is that 
which has the first internucleoside bond in 3' is of the 3'-3' type. This 
corresponds to a single inverted nucleotide residue which may, moreover, be 
abasic, i.e. in which the heterocyclic nitrogenous base is missing or replaced 

15 with a hydroxyl group. The various forms (saline or free acid) are included in 

the field of this invention. 

The backbone of the modified oligonucleotides lacking a phosphorus atom is 
preferentially made up of short alkyl or cycloalkyl chains, including derivatives 
thereof comprising one or more hetero atoms, acting as an internucleoside 

20 bond. This type of backbone may be based on a morpholino bond (partly 

consisting of the sugar of the nucleoside), on siloxane, on formacetyl and 
thioformacetyl, on methylene formacetyl and methylene thioformacetyl, on 
riboacetyl, on alkenes, on sulfamates, on sulfonate and sulfonamide, on 
methyleneimine and methylene hydrazine, on amide, and on any other group 

25 comprising various nitrogen, sulfur and oxygen atoms or methyl groups. 

For other oligonucleotide analogs, the sugar and the internucleoside bond 
(i.e. the backbone) are replaced at the same time in the nucleotide structure 
with new groups. The heterocyclic nitrogenous base is conserved in order to 
ensure hybridization with the target nucleic acid. Such oligomeric compounds, 

30 PNAs (for Peptide Nucleic Acids), have shown an excellent capacity for 

hybridization. In these compounds, the skeleton of the oligonucleotide is 
replaced with an amide-based backbone, in particular with aminoethyl glycine, 
grafted directly or indirectly onto the nitrogenous bases. In addition, thorough 
teaching regarding these PNAs may be found in Nielsen et al., Science, 1991, 

35 254,1497. 

The invention incorporates more particularly oligonucleotides with a 
phosphorothioate, amide and morpholine backbone, and the oligonucleotides 
with a hetero atom skeleton, more precisely: 
40 -CH2-NH-O-CH2- 

-CH 2 -N(CH 3 )-0-CH 2 - (called methylene(methylimino) or MMI skeleton) 

-CH 2 -0-N(CH 3 )-CH 2 - 

-CH 2 -N(CH 3 )-N(CH 3 )-CH 2 - 

-0-N(CH 3 )-CH 2 -CH 2 - (in which the phosphodiester bridge is: 0-P-0-CH 2 ). 

45 

The modification of the oligonucleotides may also be carried on the sugars: 
the preferred substitutions are at position 2' (F; O-, N- or S-alkane, O-, N- or 
S-alkene or O-, N- or S-alkyne derivatives of length C1 to C11, which may or 
may not be substituted) in particular, the preferred derivatives are: 
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0-[(CH 2 )nO]m-CH 3 
0-(CH 2 )n-0-CH 3 
0-(CH 2 )n-NH 2 
0-(CH 2 )n-CH 3 
5 0-(CH 2 )n-0-NH 2 

0-(CH 2 )n-0-N[(CH 2 )n-CH 3 ] 2 

in which n and m range from 1 to 10. 

Other modifications of the 2' position include the following groups: aliphatic 

chains, which may or may not be substituted, of length C1 to C10, aryl chains, 
10 aryl-alkyl chains and alkyl-aryl chains; -SH, -SCH 3 , -OCN, -CI, -Br, -CN, CF 3 , 

-OCF 3 , -S0 2 CH 3 , -ON0 2 , -N0 2 , -N 3 , -NH 2 ; substituted silyls; "reporter" groups; 

intercalating groups; RNA cleavage groups; group to improve the 

pharmacodynamic capacities of an oligonucleotide. The preferred 

modifications include the groups: 
15 2'-methoxyethoxy (2'-0-CH 2 CH 2 OCH 3 , also called 2'-0-(2-methoxyethyl) or 

2'-MOE) (Martin et al., Helv. Chim. Acta, 1995, 78f 486-504) 2'- 

dimethylaminooxyethoxy (0(CH 2 ) 2 ON(CH 3 ) 2 , also called 2'-DMAOE; 

2'-dimethylaminoethoxyethoxy (2'-0-CH 2 OCH 2 -N(CH 2 ) 2 , also called 2'- 

dimethylaminoethoxyethyl or 2'-DMAEOE). 
20 Another advantageous modification leads to the formation of LNAs (Locked 
Nucleic Acids) in which the hydroxyl at position 2' is attached to the carbon at 
position 3' or 4' of the sugar, then forming a sugar with a bicyclic structure. The 
preferred bridging occurs via a methyl or ethyl linkage between the 2' oxygen and 
the 4' carbon. 

25 Other preferred substitutions at position 2' include: 

-0-CH 3 (2'-methoxy) 

-0-(CH 2 ) 3 -NH 2 (2'-aminopropoxy) 

-CH 2 -CH=CH 2 (2'-allyl) 

-0-CH 2 -CH=CH 2 (2'-0-allyl) 
30 -F (2'-fluoro). 

These modifications at 2' may be in the ribo (lower) or arabino (upper) position. 
The 2'-fluoro substituent is the preferred one in the arabino position. 
Similar modifications may be made on other positions, in particular at position 3' of 
the sugar of the nucleotide at the 3'-terminal end or in the oligonucleotides with a 
35 2'-5' backbone, and at position 5' of the sugar at the 5'-terminal end. The sugars 
of the oligonucleotides can also be replaced with analogs (for example a 
cyclobutyl can be substituted for a pentofuranyl). 

The oligonucleotides can also comprise modifications or substitutions on the 
40 nucleobases (nitrogenous heterocyclic bases called "bases" by those skilled in the 
art). The natural (unmodified) bases are purines (adenine A and guanine G) and 
pyrimidines (cytosine C, thymine T and uracil U). Included among the modified 
bases are natural or synthetic molecules such as 5-methylcytosine, 5- 
hydroxymethylcytosine, xanthine, hypoxanthine, 2-aminoadenine; 6-methyl, 2- 
45 methyl and other alkyl derivatives of purine bases (A and G); 2-thio derivative (C, 
T and U); 5-halo derivative (U, C); 5-propynyl cytosine derivative (U and C); 6-azo 
derivative (U, T and C); 5-uracil; 4-thiouracil; 8-halo, 8-amino, 8-thiol, 8-thioalkyl, 
8-hydroxyl and other adenines and guanine substituted at position 8; 5-halo (in 
particular 5-bromo), 5-trifluoromethyl and other uracils and cytosines substituted in 
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position 5; 7-methylguanine and 7-methyladenine; 2-fluoroadenine; 2- 
aminoadenine; 8-azaguanine and 8-azaadenine; 7-deazaguanine and 7- 
deazaadenine; 3-deazaguanine and 3-deazaadenine. In the other modified bases, 
tricyclic pyrimidines are found such as phenoxazine cytidine (1 H-pyrimido[5,4- 
5 b][1 ,4]benzoxazin-2-(3H)-one), phenothiazine cytidine (1 H-pyrimido[5,4- 
b][1,4]benzothiazin-2(3H)-one), substituted phenoxazine cytidine (such as 9-(2- 
aminoethoxy)-H-pyrimido[5,4-b][1 ,4]benzoxazine-2(3H)-one), or carbazole 
cytidine (2H-pyrimido[4,5-b]indol-'2-one). 

The modified bases comprise the compounds in which the purine or pyrimidine 
10 heterocycle is replaced with another heterocycle, for example 7-deazaadenine, 7- 
deazaguanosine, 2-aminopyridine or 2-pyridone (The Concise Encyclopedia Of 
Polymer Science And Engineering, pages 858-859, Kroschwitz, J. I., ed. John 
Wiley & Sons, 1990; Englisch et al., Angewandte Chemie, International Edition, 
1991, 30, 613; Sanghvi, Y.S., Chapter 15, Antisense Research and Applications, 
15 pages 289-302, Crooke, ST. and Lebleu, B., ed., CRC Press, 1993). Some of 
these modified bases may be of great value for increasing the affinity of the 
oligomeric compounds of the invention, such as pyrimidines substituted at position 
5, azapyrimidines, or N- and O-substituted purines (such as 2- 
aminopropyladenine, 5-propynyl uracil, 5-propynyl cytosine). The substituted 5- 
20 methylcytosines have a positive effect on the stability of oligomer-nucleic acid 
duplexes (Sanghvi, Y.S., Crooke, ST. and Lebleu, B., eds., Antisense Research 
and Applications, CRC Press, Boca Raton, 1993, pp. 276-278) and are the 
preferred substitution, in particular in combination with 2'-methoxyethyl 
modifications of the sugars. 

25 

Preferentially, these oligonucleotides are in the single-stranded form. 

According to a particularly advantageous embodiment, these oligonucleotides 
comprise a sequence exhibiting at least 60% identity with the sequence SEQ 
30 ID No. 2, in which the nucleotides at positions 2, 4, 6, 7, 9, 1 1 , 13, 15, 17, 19 

and 20, in the 5' to 3' direction, are thioesterified. 

According to a particularly advantageous embodiment, these oligonucleotides 
comprise a sequence exhibiting at least 60% identity with the sequence SEQ 
35 ID No. 2, in which the nucleotides at positions 1, 2, 3, 4, 5, 16, 17, 18, 19 

and/or 20, in the 5' to 3' direction, are 2'-0-methylated. 

Preferentially, the oligonucleotides according to the present invention are 
DNAs. 

40 

A subject of the present invention is also oligonucleotides of the iRNA 
(Interfering Ribonucleic Acid) type comprising from 15 to 25 nucleotides, which 
hybridize specifically to the sequence SEQ ID No. 21 and which inhibit the 
expression of OB-RGRP. 
45 Preferentially, such iRNAs comprise 17 or 19 nucleotides taken continuously 

from the sequence SEQ ID No. 21, or from the sequence complementary 
thereto. Nucleotides A(A/G) and (C/T)T can be added respectively in 5' and in 
3' of this sequence of 17 or 19 nucleotides. Other types of residues or 
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chemical groups can, however, be added to these two ends, provided that they 
do not decrease the activity of the antisenses. 

The nucleotide modifications described for the antisenses are also possible for 
those making up the composition of the siRNAs. 
5 The present invention also includes any modifications of the antisenses or of 
the iRNAs which are directed toward increasing the resistance of these 
compounds to cellular nucleases, or their penetration into cells and/or their 
effectiveness in targeting the OB-RGRP sequence. 

10 

When they are DNAs, the oligonucleotides according to the present invention can 
be produced conveniently and routinely by the well-known technique of solid- 
phase synthesis. The equipment for such synthesis is sold by various specialized 
companies, such as Applied Biosystems (Foster City, CA). The synthesis of the 

15 antisenses in the present invention makes use of chemical synthesis on a suitable 
support according to methods known to those skilled in the art, in particular 
described by E. Uhlmann, A. Peyman, A. Ryte, A. Schmidt and E. Buddecke 
(1999, Methods in Enzymology 313: 268-284) and by E. Uhlmann (Recent 
advances in the medicinal chemistry of antisense oligonucleotides, Current 

20 Opinion of Drug Discovery and Develepment 3: 203-213, 2000). Any other method 
of synthesis known to those skilled in the art may also be used. 

When they are iRNAs, the oligonucleotides according to the present invention can 
be synthesized by chemical synthesis, when they are synthetic iRNAs, or 

25 expressed in situ using vectors expressing such oligonucleotides. 

siRNAs (small iRNAs) can be obtained from various suppliers, such as Proligo 
(Proligo France SAS 1 rue Robert et Sonia Delaunay 75011 Paris) Dharmacon 
(Dharmacon, Inc. 1376 Miners Drive #101 Lafayette, CO 80026) and Ambion 
(Ambion (Europe) Ltd. Ermine Business Park Spitfire Close Huntingdon, 

30 Cambridgeshire PE29 6XY United Kingdom), or can be synthesized using kits 
marketed by various companies, such as Dharmacon and Ambion. 
Preferentially, the iRNAs according to the present invention are in double- 
stranded form. 

35 After synthesis, the iRNAs are first of all taken up in RNAse-free water. The 
pairing of the two single-stranded molecules can be carried out as follows: 
20 |jmol.L" 1 of each strand are mixed in the pairing buffer (100 mmol.L" 1 of 
potassium acetate, 30 mmol. L" 1 of HEPES-KOH, pH 7.4, 2 mmol.L' 1 of 
magnesium acetate) and then heated at 90°C for 1 min, followed by incubation for 

40 1 h at 37°C. 

Transfection of the siRNAs can be carried out using the same protocol as for 
transfection of the antisenses. 

An alternative for the iRNA is the use of vectors which allow synthesis of 
45 antisense RNAs specific for the gene to be silenced and which will pair in the 
transfected cells to give an siRNA. A first vector system allows expression of an 
antisense sequence by two promoters in opposite direction, on each side of this 
sequence, producing two complementary RNAs which will pair in the transfected 
cells and give an siRNA. Another vector system uses the synthesis of an RNA 
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having the sequence of the antisense followed by the sense sequence, a few 
nucleotides apart, which will create a stem-loop RNA structure which will be 
cleaved in the transfected cells to give an siRNA. These vectors are transfected 
conventionally as described above for the various DNAs. Stable lines which 
5 exhibit a knockout of the target gene can be obtained by antibiotic selection 
conventionally used to obtain lines. 

In general, those skilled in the art may refer, for the iRNAs to the following 
publications: Elbashir S.M. et al. (2001, Nature 411: 494-498), Elbashir S.M. 
10 Lendeckel W. and Tuschl T. (2001, Genes & Dev. 15: 188-200) and Masters J.R., 
et al. (2001. Proc. Natl. Acad. Sci. USA 98: 8012-8017). 

Vectors which allow the expression of iRNAs can be obtained as described by 
Brummelkamp T.R., Bernards R., Agami R. (2002. Science 296: 550-553) and Yu 
15 J.Y., DeRuiter S.L., and Turner D. (2002., Proc. Natl. Acad. Sci. USA 99: 6047- 
6052). 

Such vectors, and also cells containing such vectors, are subjects of the 
present application. 

20 

A subject of the present application is also medicinal products containing such 
oligonucleotides, vectors and cells, and pharmaceutical compositions 
containing a pharmacologically active amount of such oligonucleotides, vectors 
and cells and pharmaceutical^ acceptable excipients. 

25 

Another subject of the present invention is the use of such oligonucleotides, 
vectors and cells, for producing a medicinal product for preventing and/or treating 
leptin-related pathological conditions. 

30 A subject of the invention is also a method of curative or preventive treatment of 
leptin-related diseases, consisting in administering such oligonucleotides, vectors 
and cells to a patient suffering from said disease. 

Another subject of the invention is a method for determining the modification, 
35 by a compound, of the interaction between the OB-RGRP or the MY047 

protein, or a protein exhibiting at least 65% identity with this protein or with the 
MY047 protein, and the leptin receptor. 

It also relates to fusion proteins for implementing this method, and also to nucleic 
40 acids encoding these proteins. 

A subject of the invention is also a method of curative or preventive treatment of 
leptin-related diseases, consisting in administering a ligand selected using the 
method defined above to a patient suffering from said disease. 

45 

A first subject of the present invention is therefore a fusion protein which is 
composed of a sequence exhibiting at least 65% identity with the sequence SEQ 
ID No. 4, or the sequence SEQ ID No. 1 6, or of a substantial part of the sequence 
SEQ ID No. 4 or of the sequence SEQ ID No. 16, and of an energy-donor or 
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energy-acceptor protein, or of a substantial and active part of an energy-donor or 
energy-acceptor protein. 

The fusion proteins according to the present invention are composed in substance 
5 of a component corresponding to part or all of a sequence exhibiting at least 65%, 
preferentially at least 75%, and even more preferentially at least 85% or 95%, 
identity with the sequence SEQ ID No. 4 or the sequence SEQ ID No. 16, or of a 
substantial part of the sequence SEQ ID No. 4 or of the sequence SEQ ID No. 16, 
and of a component corresponding to an energy-donor or -acceptor protein. They 
10 may, however, comprise other amino acid sequences, derived from other proteins, 
such as signal sequences. 

Advantageously, the energy-donor protein is Renilla luciferase. It may, however, 
be any other energy-donor protein such that the emission spectrum of the donor 
15 overlaps the excitation spectrum of the acceptor sufficiently to allow efficient 
energy transfer between the two partners. It may thus be GFP, if the energy 
transfer is FRET, or else aequorin if the energy transfer is CRET. Aequorin can be 
obtained and used as described in patent application EP 0 187 519, or in the 
article by Inouye et al. (PNAS USA 82 : 3154-3158 (1985)). 

20 

As regards the energy-acceptor fluorescent protein, it is preferentially DsRed, 
GFP or a mutant of this protein, such as YFP, EYFP, wild-type GFP, GFPS65T, 
Topaz or GFP10. 

It may however be any other energy-acceptor fluorescent protein such that the 
25 excitation spectrum of the acceptor and the emission spectrum of the donor 
overlap sufficiently to allow efficient energy transfer between the two partners. 

These proteins are known to those skilled in the art, who can find their sequences 
in the literature, in particular in the review by Blinks et al. (Pharmacol. Rev. 28 : 1- 

30 93 (1976)). In particular, GFP is described by Tsien (Annu. Rev. Biochem. 67 : 
509-544 (1998)) and the cloning thereof is described by Prasher et al. (Gene 111: 
229-233 (1992)). As regards the cloning of DsRed, it is described by Matz et al. 
(Nat. Biotechnol. 17 : 969-973 (1999)). For Rluc, those skilled in the art can refer 
to Blinks et al. (Pharmacol. Rev. 28 : 1-93 (1976)) or else to Lorenz et al. (PNAS 

35 88: 4438-4442 (1991)). 



Particularly advantageously, the donor and acceptor fusion proteins have one of 
the sequences SEQ ID No. 6, SEQ ID No. 8, SEQ ID No. 12, SEQ ID No. 14, 
40 SEQ ID No. 18 or SEQ ID No. 20, or a variant of this sequence exhibiting at least 
65% identity. 



45 Other subjects of the present invention are nucleic acids encoding these proteins. 
Such nucleic acids may be complementary or genomic DNAs, or RNAs. These 
nucleic acids or polynucleotides can be in single-chain form or in the form of 
duplex. 
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They are particularly advantageously complementary DNAs. 

Preferentially, a subject of the invention is a nucleic acid having at least 65%, 
preferentially at least 75%, and even more preferentially at least 85% or 95%, 
5 nucleotide identity with a nucleic acid of sequence SEQ ID No. 5, SEQ ID No. 7, 
SEQ ID No. 11, SEQ ID No. 13, SEQ ID No. 17 or SEQ ID No. 19. 

According to yet another aspect, the invention relates to a nucleic acid which 
hybridizes, under high stringency hybridization conditions, with a nucleic acid as 
10 defined above, and more particularly a nucleic acid of nucleotide sequence 
SEQ ID No. 5, SEQ ID No. 7, SEQ ID No. 11, SEQ ID No. 13, SEQ ID No. 17 or 
SEQ ID No. 1 9, or a nucleic acid of complementary sequence. 

For the purpose of the present invention, the "percentage identity" between two 

15 nucleotide or amino acid sequences can be determined by comparing two 
optimally aligned sequences through a window of comparison. 

The part of the nucleotide sequence or polypeptide in the window of 
comparison, may thus comprise additions or deletions (for example gaps) 
compared to the reference sequence (which does not comprise these additions or 

20 these deletions) so as to obtain optimal alignment of the two sequences. 

The percentage is calculated by determining the number of positions at 
which an identical nucleic acid base or amino acid residue is observed for the two 
(nucleic acid or peptide) sequences compared, in dividing the number of positions 
at which there is identity between the two bases or amino acid residues by the 

25 total number of positions in the window of comparison, and then multiplying the 
result by 100 in order to obtain the percentage sequence identity. 

The optimal alignment of the sequences for comparison can be produced 
on a computer using known algorithms contained in the WISCONSIN GENETICS 
SOFTWARE PACKAGE, GENETICS COMPUTER GROUP (GCG), 575 Science 

30 Doctor, Madison, WISCONSIN. 

By way of illustration, the percentage sequence identity may be produced 
using the BLAST software (versions BLAST 1.4.9 of March 1996, BLAST 2.0.4 of 
February 1998 and BLAST 2.0.6 of September 1998), using exclusively the 
default parameters (S. F. Altschul et al, J. Mol. Biol. 1990 215 : 403-410, S. F 

35 Altschul et al, Nucleic Acids Res. 1997 25 : 3389-3402). Blast searches for 
sequences similar/homologous to a reference "request" sequence using the 
algorithm of Altschul et al. The request sequence and the databases used may be 
peptide-based or nucleic acid-based, any combination being possible. 

40 For the purpose of the present invention, the expression "high stringency 

hybridization conditions" will be intended to mean the following conditions: 
1 - Membrane competition and PRE HYBRIDIZATION : 

-Mix : 40 pi of salmon sperm DNA (10 mg/ml)+ 40 pi of human placenta DNA 
(10 mg/ml) 

45 - Denature for 5 min at 96°C, then immerse the mixture in ice. 

- Remove the 2X SSC and pour 4 ml of formamide mix into the hybridization tube 
containing the membranes. 

- Add the mixture of the two denatured DNAs. 

- Incubate at 42°C for 5 to 6 hours, with rotation. 
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2- Labeled probe competition : 

- Add 10 to 50 pi of Cot I DNA to the labeled and purified probe, depending on the 
amount of repetition. 

- Denature for 7 to 10 mn at 95°C. 
5 - Incubate at 65°C for 2 to 5 hours. 

3- Hybridization : 

- Remove the prehydribidization mix. 

- Mix 40 pi of salmon sperm DNA + 40 pi of human placental DNA; denature for 5 
min at 96°C, then immerse in ice. 

10 - Add 4 ml of formamide mix, the mixture of the two DNAs and the denatured Cot I 
DNA/labeled probe to the hybridization tube. 

- Incubate for 15 to 20 hours at 42°C, with rotation. 

4- Washes : 

- One wash at ambient temperature in 2X SSC, to rinse. 

15 - 2 times 5 minutes at ambient temperature in 2X SSC and 0.1% SDS at 65°C. 

- 2 times 15 minutes at 65°C in 1X SSC and 0.1% SDS at 65°C. 
Wrap the membranes in Saran wrap and expose. 

The hybridization conditions described above are suitable for hybridization, 
20 under high stringency conditions, of a nucleic acid molecule of varying length of 
20 nucleotides to several hundred nucleotides. 

It goes without saying that the hybridization conditions described above can 
be adjusted as a function of the length of the nucleic acid the hybridization of 
which is desired, or of the type of labeling chosen, according to the techniques 
25 well known to those skilled in the art. 

The suitable hybridization conditions may, for example, be adjusted according to 
the teaching contained in the work by HAMES and HIGGINS (1985, "Nucleic acid 
hybridization : a practical approach", Hames and Higgins Ed., IRL Press, Oxford) 
or else in the work by F. AUSUBEL et al. (1989, Current Protocols in Molecular 
30 Biology, Green Publishing Associates and Wiley Interscience, N.Y.). 

The proteins which are the subjects of the present invention can be obtained by 
any means known to those skilled in the art. They are, however, advantageously 
obtained by expression of the nucleic acids as described above, encoding these 
35 proteins, optionally inserted into expression vectors, into cells advantageously 
chosen, optionally followed by an extraction and a purification which may be total 
or partial. 

The invention also relates to a recombinant vector comprising a nucleic 
40 acid according to the invention. 

Advantageously, such a recombinant vector will comprise a nucleic acid 
chosen from the following nucleic acids: 

a) a nucleic acid encoding a protein having at least 65% amino acid identity 
with a sequence SEQ ID No. 6, SEQ ID No. 8, SEQ ID No. 18 or SEQ ID No. 20, 

45 or a peptide fragment or a variant thereof; 

b) a nucleic acid comprising a polynucleotide having a sequence SEQ ID 
No. 5, SEQ ID No. 7, SEQ ID NO. 17 or SEQ ID No. 19, or a fragment or a variant 
thereof; 
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c) a nucleic acid having at least 65% nucleotide identity with a nucleic acid 
having a sequence SEQ ID No. 5, SEQ ID No. 7, SEQ ID No. 17 or 
SEQ ID No. 19, or a fragment or a variant thereof; 

d) a nucleic acid which hybridizes, under high stringency hybridization 
5 conditions, with a nucleic acid of sequence SEQ ID No. 5, SEQ ID No. 7, SEQ ID 

No. 17 or SEQ ID No. 19, or a fragment or a variant thereof. 



For the purposes of the present invention, the term "vector" will be intended 
to mean a circular or linear DNA or RNA molecule which is indifferently in single- 
10 stranded or double-stranded form. 

According to one embodiment, the expression vector comprises, besides a 
nucleic acid in accordance with the invention, regulatory sequences which make it 
possible to direct the transcription and/the translation thereof. 

According to an advantageous embodiment, a recombinant vector 
15 according to the invention will in particular comprise the following elements: 

(1) elements for regulating the expression of the nucleic acid to be inserted, 
such as promoters and enhancers; 

(2) the coding sequence included in the nucleic acid in accordance with the 
invention to be inserted into such a vector, said coding sequence being placed in 

20 phase with the regulatory signals described in (1 ); and 

(3) suitable transcription initiation and stop sequences. 

In addition, the recombinant vectors according to the invention may include 
one or more origins of replication in the cellular hosts in which their amplification 
25 or their expression is desired, markers or selection markers. 

By way of examples, the promoters for eukaryotic cells will comprise the 
thymidine kinase promoter of the HSV virus or else the mouse metallothionein-L 
promoter. 

In general, in choosing a suitable promoter, those skilled in the art may 
30 advantageously refer to the work by SAMBROOK et al. (1989, "Molecular Cloning 
: A Laboratory Manual," 2 nd ed., Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, NY.) or else to the techniques described by FULLER et al. (1996, 
Immunology in Current Protocols in Molecular Biology, Ausubel et al). 

The preferred vectors according to the invention are plasm ids, such as, for 
35 example, the vectors pCDNA3 (Invitrogen), pQE70, pQE60, pQE9 (Qiagen), 
psiX174, pBluescript SA, pNH8A, pNH16A, pNH18A, pNH46A, pWLNEO, 
pSV2CAT, pOG44, pXTI and pSG(Stratagene). 

They may also be vectors of the baculovirus type, such as the vector 
pVL1 392/1 393 (Pharmingen) used to transfect cells of the Sf9 line (ATCC No. 
40 CRL 1711) derived from Spodoptera frugiperda. 

They may also be adenoviral vectors, such as human adenovirus type 2 or 

5. 

A recombinant vector according to the invention may also be a retroviral 
vector or else an adeno-associated vector (AAV). Such adeno-associated vectors 
45 are, for example, described by FLOTTE et al. (1992, Am. J. Respir. Cell Mol. Biol., 
7 : 349-356 

Objects of the present invention are also cells comprising a protein, a nucleic acid 
or a vector as described above, or fragments of these cells, lysates of these cells 
or else membranes of these cells. 
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Such cells may be cells isolated from an organism and cultured in a suitable 
growth medium. They are, however, preferentially cell lines. Thus, such lines are 
particularly advantageously the cells lines HEK 293, COS (ATCC No. CRL 1650), 
5 COS-M6 and HeLa (ATCC No. CCL2), or else Cv 1 (ATCC No. CCL70), Sf-9 
(ATCC No. CRL 1711), CHO (ATCC No. CCL-61) or 3T3 (ATCC No. CRL-6361). 

The membranes of these cells can be prepared by any method known to those 
skilled in the art. 

10 Preferentially, they will be prepared by mechanical grinding of the cells and then 
centrifugation of the suspensions obtained, as illustrated in the examples which 
follow. 

The present invention also relates to compositions comprising cells as described 
15 above and saponin. 

The present invention also relates to a method for determining the 
modification, by a compound, of the interaction between the OB-RGRP, the 
MY047 protein or a protein exhibiting at least 65% identity with the 
20 sequence SEQ ID No. 4 or the sequence SEQ ID No. 16, and the leptin 

receptor, comprising the steps consisting in: 

- bringing said compound into contact with a protein exhibiting at least 65% 
identity with the sequence SEQ ID No. 4 or the sequence SEQ ID No. 16, 
and the leptin receptor, or cells, or fragments or lysates or membranes of 
25 cells, comprising such proteins, and optionally a suitable enzyme substrate, 

and 

- measuring the interaction between a protein exhibiting at least 65% 
identity with the sequence SEQ ID No. 4 or the sequence SEQ ID No. 
16, and the leptin receptor. 

30 

Preferentially, said compound is brought into contact with an energy-donor fusion 
protein and an energy-acceptor fusion protein, or cells, or fragments or lysates or 
membranes of cells, comprising such a protein, and optionally a suitable enzyme 
substrate. 

35 

Preferentially, said method is brought into contact with cells treated with an agent 
which permeabilizes these cells, such as saponin. 

The energy-donor fusion proteins and the energy-acceptor fusion proteins are 
chosen such that the energy resulting from the activation of the donor may be 
40 transferred efficiently to the acceptor. 

In an advantageous embodiment of said method, the energy-donor fusion protein 
is a protein from fusion with luciferase or a substantial part of luciferase, in which 
case the substrate is advantageously coelenterazine. 

45 

In a preferential embodiment of said method, the energy-acceptor fusion protein is 
a protein from fusion with YFP or a substantial part of YFP. 
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In an advantageous embodiment of said method, the energy transfer 
measured in the presence of the test compound is compared to that 
measured in the absence of the test compound. 

In another advantageous embodiment of said method, the energy transfer 
5 measured in the presence of the test compound and of leptin (or a ligand of 

the receptor) is compared to that measured in the presence of the 
compound in the absence of leptin (or a ligand of the receptor). 



10 Preferentially, the method is carried out on cell membranes, as described above. 

Preferentially, the donor and acceptor proteins according to the present invention 
are chosen such that the energy transfer takes place by first or second generation 
BRET (for Bioluminescence Resonance Energy Transfer) or LRET (for 

15 Luminescence Resonance Energy Transfer). However, such an energy transfer 
may be effected by FRET (for Fluorescence Resonance Energy Transfer) or else 
by CRET (for Chemioluminescence Resonance Energy Transfer). 
Whatever the type of energy transfer, the energy-donor fusion protein/energy- 
acceptor fusion protein pairs are chosen so as to allow such transfer. 

20 BRET2 (2 nd generation) consists of energy transfer between Renilla luciferase and 
a mutant GFP, GFP10, using a suitable substrate, DeepblueC™ coelanterazine 
(Biosignal Packard). 

CRET consists of energy transfer between aequorin, which is a luciferase, and 
GFP. 

25 FRET consists of energy transfer between two proteins of the GFP family having 
different spectra. 

To implement these transfers, those skilled in the art may refer to D. Ramsay et 
al. (Biochem J 365: 429-40 (2002)) and to K. Yoshioka et al. (FEBS Lett 523: 147- 
151 (2002)) for BRET2, to Baubet et al. (PNAS USA 97 : 7260-7265 (2000)) for 
30 CRET, and to Matyus (J Photochem Photobiol B 12: 323-337 (1992)) and Pollok 
and Heim (Trends Cell Biol 9:57-60 (1999)) for FRET. 

Another subject of the present invention is a method for screening or 
detecting compounds intended for the prevention and/or treatment of leptin- 
35 related pathological conditions, comprising the steps consisting in: 

- bringing said compound into contact with a protein exhibiting at least 65% 
identity with the sequence SEQ ID No. 4 or the sequence SEQ ID No. 16, 
and the leptin receptor, or cells, or fragments or lysates or membranes of 
cells, comprising such proteins, and optionally a suitable enzyme substrate, 

40 and 

- measuring the interaction between a protein exhibiting at least 65% 
identity with the sequence SEQ ID No. 4 or the sequence SEQ ID No. 16, 
and the leptin receptor. 



45 



Preferentially, the protein exhibiting at least 65% identity with the sequence 
SEQ ID No. 4 or the sequence SEQ ID No. 16 is the OB-RGRP or MY047. 
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The method according to the present invention is compatible with the 96-well or 
384-well plates generally used. It does not require the use of radioactive 
molecules, but is sensitive, reproducible and rapid, and the result is easy to read. 
This characteristic is particularly advantageous for carrying out large scale 
5 screening. 

The present invention also relates to the use of compounds selected using a 
method consisting in: 

-bringing said compound into contact with a protein exhibiting at least 65% 
10 identity with the sequence SEQ ID No. 4 or the sequence SEQ ID No. 16, 

and the leptin receptor, or cells, or fragments or lysates or membranes of 

cells, comprising such proteins, and optionally a suitable enzyme substrate, 

and 

- measuring the interaction between a protein exhibiting at least 65% 
15 identity with the sequence SEQ ID No. 4 or the sequence SEQ ID No. 

16, and the leptin receptor. 

A subject of the present invention is, finally, a method of curative or preventive 
treatment of leptin-related diseases or diseases related to its receptor, comprising 
20 the steps of: 

- selecting said compound using a method consisting in: 

+bringing said compound into contact with a protein exhibiting at least 65% 
identity with the sequence SEQ ID No. 4 or the sequence SEQ ID No. 16, 
and the leptin receptor, or cells, or fragments, or lysates or membranes of 
25 cells, comprising such proteins, and optionally a suitable enzyme substrate, 

and 

+measuring the interaction between a protein exhibiting at least 65% identity with 
the sequence SEQ ID No. 4 or the sequence SEQ ID No. 16, and the leptin 
receptor 

30 - administering said compound to a patient suffering from said disease. 



Leptin-related pathological conditions may be diseases related to a decrease in 
bone density, such as, for example, osteoporosis, or, conversely, those related to 
35 considerable calcification. 

They may also be diseases which have an effect on weight, such as obesity, 
diabetes or anorexia. 

They may also be diseases which have an effect on sexual maturation, 
hematopoiesis, angiogenesis, thrombus formation, the regulation of immunity and 
40 inflammation, fetal development and cancer. 

The compounds of the invention, oligonucleotides, iRNAs, or other compounds, 
may be formulated in pharmaceutical compositions for the purpose of topical, oral, 
parenteral, intranasal, intravenous, intramuscular, subcutaneous, intraocular 
45 administration, etc. Preferentially, the pharmaceutical compositions contain 
pharmaceutical^ acceptable vehicles for an injectable formulation. They may in 
particular be isotonic, sterile, saline (monosodium or disodium phosphate, sodium, 
potassium, calcium or magnesium chloride, etc., or mixtures of such salts) 
solutions, or dry, in particular lyophilized, compositions which, by addition, as 
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appropriate, of sterilized water or of physiological saline, make it possible to 
constitute injectable solutes. 

The formulation of therapeutic compositions and their administration fall within the 
competence of those skilled in the art. 
5 The formulation of the compounds may include various products known to those 
skilled in the art. Preferentially, the compounds may, for example, have salts, such 
as sodium, potassium, ammonium, magnesium, calcium, polyamines, or 
hydrochloric, hydrobromic, sulfuric, phosphoric or nitric acid, added to them. Other 
salts can also be used, such as those originating from acetic, oxalic, tartaric, 
10 succinic, maleic, fumaric, gluconic, citric, malic, ascorbic, benzoic, tannic, palmitic, 
alginic, polyglutamic, naphthalenesulfonic, methanesulfonic, p-toluenesulfonic, 
naphthalenedisulfonic or polygalacturonic acid. Finally, chlorine, bromine and 
iodine salts can also preferentially be used. 

15 The composition and the formulation for topical administration can include 
transdermal patches, ointments, lotions, creams, gels, drops, suppositories, 
sprays, liquids and powders. 

The composition and the formulation for oral administration can include powders, 
20 granules, microparticles, nanoparticles, suspensions, solutions, which may or may 
not be aqueous, capsules, gelatin capsules, sachets, tablets or mini tablets. 
Thickeners, flavors, diluents, emulsifiers, dispersing agents or binders may be 
added. 

25 The composition and the formulation for parenteral, intrathecal or 
intravententricular administration can include sterile aqueous solutions which can 
also contain buffers, diluents and other additives, such as, but not limited to, 
penetration-increasing agents, transporting products and excipients. 

30 The composition can be formulated and used as a foam, an emulsion, a 
microemulsion, cationic, pH-sensitive or negatively charged liposomes, and 
transferomes. 

In general, the various formulations can contain a mixture of one or more agents, 
35 such as, but not limited to, agents which increase the penetration of the 
compound (surfactants, bile salts, chelating agents, non-chelating surfactants), 
excipients (binders, fillers, lubricants, disintegrating agents, wetting agents), or 
transporters (water, saline solutions, alcohols, polyethylene glycol, gelatin, 
lactose, amylose, magnesium stearate, talc, silicic acid, viscous paraffin, 
40 hydroxymethylcellulose, polyvinylpyrrolidone). Other components can be added, 
such as dyes, flavors, preserving agents, antioxidants, opacifiers, thickeners and 
stabilizers. 

The dosage depends on the severity and on the sensitivity of the state of the 
45 disease to be treated, with a treatment period possibly ranging from a few days to 
a few months, or until the treatment is effective or a reduction in the disease is 
observed. The optimum dosage can be calculated from measurements of 
accumulation of the therapeutic agent in the patient's body. Those skilled in the art 
can easily determine the optimum dosages, the methods of dosage and the rates 
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of repetition of these dosages. The optimum dosages can vary as a function of the 
relative effectiveness of each oligonucleotide or iRNA, and can, in general, be 
estimated by measuring the EC50s of the doses used in vitro and in vivo in animal 
models. In general, the dosage is between 0.01 |jg and 100 g per kilo of 
5 bodyweight and can be administered one or more times, daily, weekly, monthly or 
annually, or even once every 2 to 20 years. 

Competent individuals can easily determine the rate of repetition of the dosages 
based on the amount of time the compound is present in the body fluids or the 
tissues. Subsequent to a successful treatment, it may be desirable for the patient 
10 to continue a maintenance therapy in order to prevent reappearance of the 
disease; to do this, the oligonucleotide or the iRNA is administered at 
maintenance doses ranging from 0.01 |jg to 100 g per kilo of bodyweight, one or 
more times a day, up to once every 20 years. 

15 The administration of antisense in vivo has been carried out successfully by 
various authors, using protocols of simple injection of antisense intravenously (He 
et al. (1998) Zhonghua Shi Yan He Lin Chuang Bing Du Xue Za Zhi 12:1-4) or 
intracerebral^ (Yoburn et al. (2003) Synapse 47: 109-116, Tischkau et al. (2003) 
J. Biol. Chem. 278: 718-723). In the last two years, more complex systems for 

20 targeting antisense in the organism have been developed and used successfully 
(Morishita et al. (2002) J. Endocrinol. 175: 475-485, Bartsch et al. (2002) Pharm. 
Res. 19: 676-680), making it possible, in mice and rats, to treat various cancers 
(Rait et al. (2002) Mol. Med. 8: 475-486, Ochietti et al. (2002) J. Drug. Target 10: 
113-121, Eder et al. (2002) Cancer Gene Ther. 9:117-125). The transfection of 

25 antisenses involves the same methods as for the transfection of iRNAs, making it 
possible to envision the same applications in vivo for the iRNAs. With this in mind, 
it is possible to imagine targeting the antisense or the iRNAs to the central 
nervous system, in order to treat disorders of central origin (obesity), but also 
those produced by a peripheral action of leptin receptors. More particularly, it is 

30 possible to envision there being an action of the antisenses or of the iRNAs on the 
transport of leptin across the blood-brain barrier, involving OB-R. Moreover, 
endothelial cells have already been successfully targeted using an in vivo 
antisense strategy (Bartsch et al. (2002) Pharm. Res. 19: 676-680). 

35 

Figures : 
Figure 1 

Sequences of the various antisense ODNs used. 

40 

Figure 2 

Alignment of the OB-RGRP protein sequences of various species and of the 
human MY047 protein sequence. The potential transmembrane domains were 
determined by various methods (HMMTOP, TMHMM, TopPred2, TMpred) and are 
45 written in bold. 

Figure 3 

Topology of OB-RGRP studied by BRET, using the double fusion protein YFP- 
OB-RGRP-Luc. Figure 3a: diagrammatic representation of the topology of OB- 
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RGRP for the models 3 and 4TM. Figure 3b: results of the BRET experiments 
using the proteins indicated. The data are expressed in mBU. 

Figure 4 

5 Study of the oligomerization of OB-RGRP with SDS-PAGE experiments and 
immunoprecipitations. Figure 4a: the cells expressing the fusion proteins indicated 
were treated or not treated with 2 mmol.L" 1 dithiobis(succinimidyl propionate) 
(DSP) in PBS (1X, pH7.4) in order to crosslink the protein complexes. The 
proteins were separated by SDS-PAGE and the proteins from fusions with YFP 
10 were detected using a specific anti-YFP antibody. Figure 4b: the cells expressing 
the construct 6Myc-OB-RGRP were solubilized with 1% of digitonin or 5% of SDS 
and the solubilized material was immunoprecipitated with an anti-myc antibody. 
The precipitates were subjected to separation by SDS-PAGE and the proteins 
tagged with myc were detected with an anti-myc antibody. 

15 

Figure 5 

Identification of the molecular determinants involved in the oligomerization of OB- 
RGRP. The proteins from fusions with the OB-RGRP truncations were treated as 
described in Fig. 4b. TM, transmembrane domain. 

20 

Figure 6 

Study of the oligomerization of OB-RGRP in live HEK cells, by BRET technology. 
Figure 6a: the fusion proteins indicated were coexpressed at an equimolar ratio, 
and BRET measurement experiments were carried out. Figure 6b: constant 
25 amounts of the plasmid OB-RGRP-Luc were coexpressed with increasing 
amounts of the plasmid OB-RGRP-YFP and BRET measurements were carried 
out. MT2R-Luc, protein from fusion of the melatonin receptor MT2 with luciferase. 

Figure 7 

30 Interaction of OB-R s and of OB-RGRP studied by BRET. The fusion proteins 
indicated were coexpressed at an equimolar ratio and BRET measurements were 
carried out. IR-YFP, protein from fusion of the insulin receptor with YFP. 

Figure 8 

35 Dose-dependent activation of the reporter genes for STAT3 (Figure 8a) and 
STATS (Figure 8b) in HeLa cells, by OB-Ri in the presence of overexpression of 
the OB-RGRP protein constructs as indicated. 

Figure 9 

40 Effect of the overexpression of OB-RGRP on the expression of OB-R at the 
surface of the cells. HEK 293 cells transfected or not transfected with the OB- 
RGRP expression vector, and COS cells transfected with the OB-Ri or OB-R s 
expression vectors and +/- the OB-RGRP vectors, were used to determine the 
amount of receptors expressed at the surface and the total expressed in the cells, 

45 by 125 l-leptin-binding experiments. 

Figure 10 

Effect of the various antisense oligodeoxynucleotides (ODNs) on the level of OB- 
RGRP messengers observed by semiquantitative RT-PCR. Figure 10a: 
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determination of the linear zone of amplification of the OB-RGRP and GAPDH 
transcripts, as a function of the number of PCR cycles. Figure 10b: quantification 
of the results shown in panel a. Figure 10c: determination of the relative levels of 
expression of the OB-RGRP mRNAs at 26 PCR cycles, in the cells incubated with 
5 the various antisense ODNs. 

Figure 11 

Effect of the OB-RGRP-specific antisense ODNs on the activation of a STAT3 
reporter gene. The HeLa cells were cotransfected firstly with the OB-RI expression 
10 vector and the constructs of the reporter genes for STAT3 or 5, and then with the 
antisense ODNs indicated. After 48 hours of stimulation or no stimulation with 
10 nmol.L" 1 of leptin. 

Figure 12 

15 Effect of the OB-RGRP-specific antisense ODNs on the surface expression of OB- 
R. The HeLa cells were transfected or not transfected with the OB-R| or OB-R s 
expression plasmids, before a second transfection with the antisense ODNs 
indicated, or no second transfection. 48 h post-transfection, the total amount of 
OB-R and the fraction exposed at the surface were determined in binding 

20 experiments with 125 l-leptin. 

The present invention is illustrated, without, however, being limited, by the 
following examples. 

25 Materials and methods used in the examples 

Plasmid construction 

The proteins from fusion of OB-R with YFP and luciferase were constructed 
by ligation of YFP and of luciferase to the C-terminal portion of the OB-R 

30 receptors, by standard molecular biology techniques. The coding region of YFP 
was obtained from the vector Cytogem®-Topaze (pGFPtpz-N1) (Packard, 
Meriden, CT) and was inserted into the EcoRV site of the vector pcDNA3/CMV 
(Invitrogen, Groningen, The Netherlands) containing a modified polylinker. The 
coding region of Renilla luciferase was obtained from the vector pRL-CMV 

35 (Promega, Madison, Wl) and inserted into the EcoRV site of the modified vector 
pcDNA3. The coding regions of OB-Ri and of OB-R s (a gift from Dr. Gainsford, 
Royal Melbourne Hospital, Victoria, Australia) were inserted into the two vectors 
described above, respectively into the EcoR1/BamH1 and Nhe1 sites. The stop 
codons were deleted by site-directed mutagenesis and the frame of the fusion 

40 proteins was adjusted at the same time. 

The vector pcDNA3-OB-RGRP was obtained by insertion of the coding 
region of OB-RGRP, obtained from the vector pCDNA3-Di1, into the EcoR1 and 
Xba1 sites of the vector pcDNA3/CMV (Invitrogen, Groningen, The Netherlands). 
The stop codon of OB-RGRP was deleted by site-directed mutagenesis. The 

45 vector pcDNA3-OB-RGRP-Luc was obtained by digestion of the vector pRL-CMV 
N3 (Promega, Madison, Wl) with Sma1 and Hpa1 and by insertion of the fragment 
corresponding to the coding region of Renilla luciferase, after the coding region of 
OB-RGRP, into the filled-in BspE1 site of the vector pcDNA3-OB-RGRP. 
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The vector pcDNA3-YFP was obtained by subcloning the coding region of 
YFP from the vector pGFPtpz-N1 (Packard, Meriden, CT) inserted into the EcoRV 
site of the vector pcDNA3/CMV. The vector pcDNA3-OB-RGRP-YFP was 
obtained by insertion of the BamH1/BspE1 fragment of the vector pCDNA3-OB- 
5 RGRP non-stop into the vector pcDNA3-YFP digested with the BamH1 and Age1 
enzymes. 

The construct pcDNA3-GFP-OB-RGRP-Luc was obtained by insertion of 
the OB-RGRP-Luc fragment of the vector pcDNA3-OB-RGRP-Rluc, cleaved with 
EcoR1, into the EcoR1 site of the vector pcDNA3-YFP. The stop codon of the 

10 YFP was removed by site-directed mutagenesis. 

The vector 6Myc-OBR-GRP (4TM) was obtained by insertion of the 6myc 
fragment of the vector pCDNA3-RSV-6Myc into the BamH1 and EcoR1 sites of 
the vector pCDNA3-OBRGRP. The various OB-RGRP deletions (2 and 3 TM) 
were obtained by PCR and the insertion into the vector pcDNA3, into the EcoR1 

15 and Xba1 sites. The coding sequence of MY047 was obtained by RT-PCR on 
mRNAs of human origin. The PCR fragment was digested with the EcoR1/Xba1 
restriction enzymes and inserted into the vector pcDNA3-Topaze cleaved with the 
same enzymes. The stop codon of the YFP was then removed by site-directed 
mutagenesis, so as to obtain the vector pcDNA3-YFP-MY047. The vector 

20 pcDNA3-MY047-GFP was obtained by insertion of the DNA fragment obtained by 
PCR on the vector pcDNA3-YFP-MY047 and cleaved with BamH1, then inserted 
into the vector pcDNA3-YFP cleaved with the same enzyme. Insertion of the same 
fragment into the vector pcDNA3-Rluc cleaved with BamH1 made it possible to 
obtain the vector pcDNA3-MY047-Rluc. 

25 All the constructs were verified by sequencing. 

Cell culture and transfection 

The HEK 293, COS-7 and HeLa cells were cultured in DMEM 
supplemented with 10% (v/v) of SVF, 4.5 g/liter of glucose, 100 U/ml of penicillin, 
30 0.1 mg/ml of streptomycin and 1 mmol.L" 1 of glutamine (all from Life Technologies, 
Gaithersburg, MD). The transient transfections were carried out with the FuGene 
6 reagent (Roche, Basel, Switzerland) according to the supplier's instructions. 

Preparation of membranes and solubilization 

35 The membranes were prepared as previously described (19), and 

resuspended in 75 mmol.L" 1 Tris (pH 7.4), 12.5 mmol.L" 1 MgCb and 5 mmol.L" 1 
EDTA, and immediately used in BRET experiments. 

SDS PAGE and Western blotting 

40 The total lysates were prepared by washing the cells once with cold PBS 

(pH 7.4) and denatured by adding loading buffer (30 mmol.L" 1 Tris HCI, pH 6.8, 
1% glycerol, 5% SDS, 50 mmol.L" 1 DTT and 0.05% bromophenol blue). The total 
lysates or the immunoprecipitates were incubated for 10 minutes at 90°C and then 
loaded onto 10% acrylamide gel for separation by electrophoresis (SDS-PAGE). 

45 The proteins were then transferred onto a nitrocellulose membrane and revealed 
with specific primary antibodies: anti-YFP (8367-1 Living Colors) diluted to 1/200, 
anti-myc A14 (sc-789 TEBU Peprotech Santa Cruz Biotechnology) diluted to 
1/500, then a secondary antibody coupled to peroxidase (anti-rabid goat IgG; 
Jackson Immunoresearch Laboratories, Inc., West Baltimore Pike) diluted to 
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1/10,000. The immunoreactive bands were revealed with an ECL kit (Pharmacia 
Biotech). 

Immunoprecipitation 

5 Two days after transfection, the cells were washed once with cold PBS, 

and the proteins were extracted by incubation for 15 minutes in lysis buffer (1X 
PBS, 1% Nonidet P40, 0.5% sodium deoxycholate, 0.1% SDS, 0.02% NaN 3 , 
10 mg.L" 1 benzamidine and 5 mg/L" 1 trypsin inhibitors). The lysate was centrifuged 
at 18000 g for 15 min and the supernatant was then incubated for 3 hours at 4°C 
10 with an anti-myc antibody coupled to agarose beads (sc 40AC TEBU preprotech, 
Santa CRUZ Biotechnology). The precipitates were washed three times with cold 
lysis buffer and denatured with loading buffer for SDS-PAGE. 

Radiobinding experiments 

15 The radiobinding experiments were carried out as previously described (Barr et 
al.), with slight modifications. To determine the surface leptin binding, the cells 
cultured in the 6-well plates were washed twice with cold PBS and incubated in 
the binding buffer (DMEM, 25 mmol.L" 1 Hepes, pH 7.4, 1% BSA) containing 
100,000 cpm/well of 125 l-leptin (PerkinElmer life sciences, Paris, France) in the 

20 presence or absence of 200 nmol.L -1 of leptin (PreproTech Inc, USA) for 4 h at 
4°C. The cells were washed twice with cold PBS, then lyzed in 1N NaOH, and the 
radioactivity was determined in a y counter. To determine the total binding of 
leptin, the cells cultured in dishes 10 cm in diameter were solubilized in 1.5 ml of 
binding buffer containing 0.15% of digitonin, for 2 h at 4°C. The extracts were 

25 centrifuged for 30 min at maximum speed and at 4°C. The supernatants (0.2 ml) 
were incubated with 100,000 cpm of l-leptin in the presence or absence of 200 
nmol.L" 1 of leptin, in a total volume of 0.25 ml, with constant rotation at 4°C 
overnight. 0.5 ml of y-globulin (1 .25 mg/ml) and 0.5 ml of polyethylene glycol 6000 
(25% w/v) were added in order to precipitate the receptor-ligand complexes, which 

30 were centrifuged at 17,000 g for 3 min. The pellet was washed once with 1 ml of 
polyethlyene glycol 6000 (12% w/v) and the radioactivity was determined in a y- 
counter. 

Reporter gene activation assay 

35 The HeLa cells cultured in wells of 6-well plates were cotransfected with 

500 ng of a reporter plasmid expressing firefly luciferase under the control of 
STAT3 or STAT5 factor response elements (a gift from Dr. Levy, University of 
New York, New York, USA), 250 pg of the expression vector pcDNA3-f?en///a 
luciferase (used as internal standard between the samples) and with 500 ng of the 

40 various OB-R expression vectors or the vector alone. 48 h after transfection, the 
cells were starved overnight in Optimem medium (Invitrogen, Groningen, The 
Netherlands) containing 1% of BSA, before stimulation with 10 nmol.L" 1 of leptin, 
or no stimulation, for 48 h. The cells were then washed once with PBS, then lyzed 
in passive lysis buffer (Promega Corporation, Madison, Wl) for 15 min at ambient 

45 temperature. The total lysates were centrifuged for 2 min at 15,000 g and the 
supernatants were used in an assay to measure luciferase (Dual Luciferase Assay 
System from Promega Corporation, Madison, Wl) using a Berthold luminometer 
(Lumat LB 9507). The results are expressed as ratio of firefly luciferase activity to 
Ren ilia luciferase activity. 
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BRET measurements in microplates 

48 h after transfection, the COS-7, HeLa or HEK 293 cells expressing the 
OB-R fusion proteins were detached and washed in PBS. 1-2x10 5 cells were 
5 distributed into wells of optiplate plates (96-well, Packard Instrument Company, 
Meriden, CT) in the presence or absence of the ligands, and incubated at 25°C. 
Alternatively, the same procedure was carried out with membranes prepared from 
the cells expressing the various constructs. The substrate, coelenterazine h 
(Molecular Probes, Eugene, OR), was added at a final concentration of 5 pmoLL' 1 

10 and the readings were carried out with a Fusion™ luminometer/fluorimeter 
(Packard Instrument Company, Meriden, CT), which makes it possible to measure 
luminescence through two filters (luciferase filter: 485 ±10 nm; YFP filter: 530 + 
12.5 nm). The BRET ratio was defined as the difference in emission at 530 
nm/485 nm of the cells cotransfected with the Luc and YFP fusion proteins and 

15 the emission at 530 nm/485 nm of the Luc fusion protein transfected alone into the 
cells. The results are expressed as milliBRET units (mBU), 1 mBRET 
corresponding to the values of the differences in the ratios multiplied by 1000. 

RT-PCR 

20 The total RNAs were extracted by the method of Chomczynski and Sacchi 

(Chomcynzki P., and Sacchi N. (1987) Anal. Biochem. 162, 156-159). 1 \jg of 
RNA is denatured for 5 minutes at 68°C and then abruptly cooled for 5 min at 4°C. 
The denatured sample is reversed transcribed for 1 h at 37°C in 20 pi of RT 
reaction medium (5 |j mol.L" 1 PdN6, 10|jmol.L" 1 DTT, 50 mmol.L" 1 Tris-HCI, 

25 pH=8.3, 75 mmol.L -1 KCI, 5 mmol.L" 1 MgCI 2 , 500 Mmol.L" 1 dNTP, 200U 
RT-MMLV). A 2.5 pi aliquot of this reaction is used for a PCR reaction in a final 
volume of 25 pi (40 mmol.L' 1 Tris-HCI, pH 8.4: 100 mmol.L' 1 KCI; 1.5 mmol.L' 1 
MgCb; 0.2mmol.L" 1 of each dNTP; 0.141 mmol.L" 1 of primers specific for OB- 
RGRP (sense: CCGTGGCAGGAAGC, antisense: CAGCCACACGAGCAAG) and 

30 0.035 mmol.L" 1 of primers specific for glyceraldehyde phosphate dehydrogenase 
(GAPDH) (sense: GGAGAAGGCTGGGGC, antisense: 

GATGGCATGGACTGTGG) and 2.5U of TAQ DNA polymerase). The following 
protocol was used for the PCR reaction: Initial denaturation for 3 min at 94°C, then 
22 to 30 cycles of denaturation (20 sec at 94°C), hybridization (20 sec at 59°C), 

35 elgonation (20 sec at 72°C) followed by a final elongation of 7 min at 72°C. 

An aliquot of the PCR reaction was loaded onto a 2% agarose gel in order 
to separate the reaction products by electrophoresis. The expected sizes of 
fragments of GAPDH and of OBR-GRP are, respectively, 229 bp and 334 bp. 

40 Oligonucleotide synthesis 

The oligonucleotides were synthesized on an automatic DNA synthesizer 
("Expedite MOSS" 8909 model from Applied Biosystems) by standard 
phosphoramidite chemistry and iodine oxidation. The demethylation was carried 
out with a 0.2 mol.L" 1 solution of 3H-1 ,2-benzodithiol-3-one 1,1 -dioxide in 
45 acetonitrile for 120 s. The detachment from the support and the deprotection were 
carried out in concentrated ammonia (18 h at 55°C), and the oligonucleotides 
were then purified by precipitation. The deprotection product was precipitated with 
10 volumes of 1-butanol; the pellet taken up in one volume of 0.3 mol.L" 1 NaCI 
was reprecipitated by adding 4 volumes of ethanol. 
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The analysis on a 20% polyacrylamide gel (in a buffer of 8 mol.L" 1 urea and 
454 mmol.L" 1 Tris-borate, at pH 7.0) showed a greater than 80% proportion of 
product of expected length. 

5 

Transfection of the antisense oligodeoxynucleotides 

For the transfection of 300,000 cells cultured in a well of a 6-well plate, 10 pi of 
antisense ODN at 20 |jmol.L" 1 were diluted in 175 pi of DMEM. 3 \j\ of 
oligofectamine (Invitrogen, Groningen, The Netherlands) and 12 pi of DMEM were 

10 incubated in a second tube for 10 min at ambient temperature. The 
oligofectamine/DMEM mixture was then added to the diluted antisense ODN, 
vortexed and incubated for 20 min at ambient temperature. During this time, the 
cells were washed once with PBS and once with DMEM, and then covered with 
800 pi of DMEM. The ODN/oligofectamine mixture was then added dropwise to 

15 the cells and incubated for 4 h at 37°C, before adding 500 pi of DMEM 
supplemented with 30% serum. 

Example 1 : Topology and cellular location of OB-RGRP 

20 To study the topology and the subcellular location of OB-RGRP, the protein 

was tagged with the yellow variant of green fluorescent protein (YFP) at the end of 
its C-terminal tail. The fusion protein was expressed in HeLa cells and its location 
was determined by fluorescence microscopy. 

The results show that the fusion protein is preferentially targeted to the 

25 perinuclear membranes and into intracellular vesicles. Similar results were 
observed in HEK cells. No colocalization with cytoplasmic and nuclear proteins 
was observed, confirming the location of OB-RGRP in membranes (not shown). 
The exact nature of the membrane compartment was determined by colocalization 
studies with markers specific for subcellular compartments. A strong colocalization 

30 was observed with the invariant chain of MHC II molecules, a marker for the 
endocytic compartment. 

Initial analysis of the topology of OB-RGRP suggested an organization in 3 
transmembrane (TM) domains (14). A similar organization has been proposed for 
MY047 (16). However, a new analysis of the hydrophobicity profile of the various 

35 protein sequences available for OB-RGRP and MY047 is also compatible with a 
4-TM model (Fig. 2). The topology differs profoundly between these two models. 
In the 3-TM model, the N- and C-terminal ends are located on each side of the 
membrane, whereas in the 4-TM model, the two tails are oriented on the same 
side of the membrane (Fig. 3a). To determine the correct model, we used the 

40 resonance energy transfer (BRET) method which has recently been developed to 
follow protein-protein interactions in living cells (Xu et al. (1999) Proc Natl Acad 
Sci USA 96, 151-156). In the event of physical proximity (< 100 A between the two 
interacting proteins, an energy transfer can take place between the energy-donor 
(Luc) and the energy-acceptor (YFP), fused to the two proteins of interest. We 

45 tagged the N-terminal tail of OB-RGRP with YFP, and the C-terminal tail with 
luciferase, and we observed the energy transfer by measuring BRET with this 
double fusion protein. The 3-TM model does not allow transfer since the two 
BRET partners are separated by the lipid bilayer. On the other hand, the 4-TM 
model predicts strong energy transfers since the two partners are located on the 
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same side of the membrane. As shown in Figure 3b, a very strong energy transfer 
was detected for the double fusion protein in the intact cells, indicating that OB- 
RGRP has 4-TMs. 

This set of results suggests that OB-RGRP is a membrane-bound protein 
5 with 4 transmembrane domains, having 3 short loops and short N- and C-terminal 
ends oriented on the same side of the membrane. OB-RGRP is mainly located in 
intracellular compartments. 

Example 2: Oligomerization of OB-RGRP 

10 

Oligomerization is a property common to various proteins, including 
membrane-bound proteins such as tyrosine kinase receptors, cytokine receptors 
and phosphotyrisine phosphatases. It has been shown that this oligomerization 
plays an important role in the function of these proteins. To obtain elements in the 

15 function of OB-RGRP, we wanted to know whether this protein oligomerizes. 

OB-RGRP was tagged with YFP at its C-terminal tail and expressed in 
HeLa cells. The proteins were separated by polyacrylamide gel electrophoresis 
under denaturing conditions (SDS-PAGE) and immunoblotting experiments were 
carried out with an anti-YFP antibody. Figure 4a reveals several bands specific for 

20 OB-RGRP-YFP, corresponding to monomeric and dimeric forms and oligomeric 
complexes. Similar results were obtained with OB-RGRP tagged at the N- 
terminal, either with YFP or a myc epitope (Fig. 4 a,b). Formation of the OB- 
RGRP oligomers was observed on total cell extracts after immunoprecipitation. 
The use of a crosslinker on whole cells stabilizes the dimeric complexes, 

25 indicating that the dimeric form is the predominant form of OB-RGRP in intact 
cells (Fig. 4a). 

Surprisingly, OB-RGRP has unexpected properties since the oligomers are 
stable in the presence of various denaturing and/or dissociating agents such as 
5% SDS, 1% Triton X-100, 1% Nonidet P40, 1% digitonin, 50 mmoLL -1 DTT and 

30 2% p-mercaptoethanol. However, similar observations were obtained for other 
membrane-bound proteins such as glycophorin A and G protein-coupled p2- 
adrenergic receptors. Studies on these proteins show, respectively, that 
LIXXGVXXG and LXXXGXXXGXXXL motifs in the transmembrane domains are 
essential for oligomer formation. Similar motifs were identified in the membrane 

35 regions of OB-RGRP. 

To identify the molecular determinants involved in the dimerization, we 
prepared OB-RGRP constructs exhibiting progressive deletions of the C-terminal 
tail (Fig. 5). A construct containing the first two potential TMs loses the ability to 
form oligomers. Addition of the 3 TM restores the possibility of forming dimers. 

40 However, the complete oligomerization profile was only observed in the presence 
of the 4 potential TMs. 

Oligomers of membrane-bound proteins can be artifacts induced during the 
preparation of samples (solubilization, denaturation, etc.). For this reason, it is 
important to verify the oligomerization of proteins in living cells. Recently 

45 developed energy transfer techniques such as BRET make it possible to follow 
such protein-protein interactions in living cells. Fusion proteins of OB-RGRP with 
luciferase and YFP were used to follow OB-RGRP oligomerization in living cells. 
Coexpression of the OB-RGRP-YFP or YFP-OB-RGRP constructs with the OB- 
RGRP-Luc construct induces an energy transfer (Fig. 6a). The specificity of this 
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interaction was shown by the lack of energy transfer during the coexpression with 
two different fusion proteins: p-arrestine2-YFP (Angers et al. (2000) Proc Natl 
Acad Sci USA 97, 3684-3689), or melatonin-Luc MT2 receptor (Ayoub et al. 
(2002) J Biol Chem 277, 21522-21528). We then expressed various ratios of the 
5 BRET partners (Fig. 6b). The BRET signal is increased in a hyperbolic manner as 
a function of the OB-RGRP-YFB/OB-RGRP-Luc ratio, reaching an asymptote 
which corresponds to saturation of the energy-donor molecules (OB-RGRP-Luc) 
by the acceptor molecules (OB-RGRP-YFP), which is expected in the case of a 
specific interaction. 

10 Collectively, these results show that OB-RGRP is a dimeric membrane- 

bound protein which can also be involved in high molecular weight oligomeric 
complexes. The 3 rd and 4 th potential transmembrane domains appear to be 
important for oligomer formation. 

15 Example 3: Interaction between OB-R and OB-RGRP 

We used BRET technology to study a possible interaction between OB-R 
and OB-RGRP in living cells. An energy transfer was constitutively observed in 
the cells coexpressing the OB-R s -Luc construct and the OB-RGRP-YFP construct, 

20 indicating proximity of the interaction partners (Fig. 7). The same results were 
obtained in cells coexpressing OB-R s -Luc and the MY047-YFP construct, and 
also in the reverse orientation: in cells coexpressing OB-RGRP-Luc and OB-R s - 
YFP, or in cells coexpressing MY047-Luc and OB-R s -YFP. The specificity of 
these interactions was confirmed by the lack of energy transfer between OB-R s - 

25 Luc, OB-RGRP-Luc, MY047-Luc and a construct of the insulin receptor tagged 
with YFP (Boute et al. (2001) Mol Pharmacol 60, 640-645), and also in the reverse 
orientation: by the lack of energy transfer between a construct of the insulin 
receptor tagged with Luc and the OB-R s -YFP, OB-RGRP-YFP and MY047-YFP 
construct. Coexpression of OB-R s -Luc and an OB-RGRP or MY047 construct 

30 exhibiting the YFP tag at the N-terminal produces no significant signal, confirming 
the specificity of interaction with OB-RGRP-YFP and MY047, and indicates that 
the N-terminal end of OB-RGRP and MY047 must be involved in the interaction 
with OB-R. 

No significant energy transfer was observed in the cells coexpressing the 
35 OB-R r Luc and OB-RGRP-YFP or YFP-OB-RGRP constructs. This is not due to a 
lack of functional OB-R r Luc expression since a specific BRET signal was 
observed in cells coexpressing OB-R r YFP in order to follow OB-R dimerization. 
The lack of BRET between the OB-R r Luc and OB-RGRP-YFP fusion proteins 
does not exclude a direct interaction between these two proteins since this may 
40 be explained by the fact that the distance between the two BRET partners (Luc 
and YFP) is greater than 100 A, the maximum distance for obtaining a transfer. 
This should be the case since the N- and C-terminal ends of OB-RGRP should be 
located close to the transmembrane region of OB-R, whereas the C-terminal end 
of OB-Ri should more probably point toward the cytoplasm due to its long 
45 intracellular tail of approximately 300 amino acids. Given that the short and long 
isoforms of OB-R share the same trans- and juxtamembrane regions and that the 
interaction of OB-RGRP with OB-R s is located at this level, it is probable that OB- 
RGRP interacts with OB-Ri in the same way as with OB-R s . 
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Example 4: Effect of the overexpression of OB-RGRP on OB-R signaling 

Constructs containing STAT3- or STAT5-response elements upstream of a 
luciferase reporter gene were coexpressed with OB-R| in the presence or absence 
5 of various OB-RGRP constructs (Fig. 8). The two constructs were activated by 
leptin in a dose-dependent manner, with an EC50 of approximately 50 pM. Similar 
results were obtained in HEK 293 cells stably expressing a reporter gene for 
STAT3. The overexpression of various OB-RGRP constructs had no reproducible 
effect on this activation, indicating that OB-RGRP is not a limiting factor. 

10 

Example 5: Effect of the overexpression of OB-RGRP on the expression of 
OB-R at the surface 

In yeast knockout for OB-RGRP (Vps55), protein transport is disturbed 
15 between the golgi and the vacuoles (Belgareh-Touze et al. (2002) Molecular 
Biology Of The Cell 13, 1694-1708). Although OB-R are activated only when they 
are expressed at the plasma membrane, a considerable amount of receptors is 
accumulated in intracellular compartments (Barr, et al. (1999) J Biol Chem, 274, 
21416-21424) (Lundin et al. (2000) Biochimica and Biophysica Acta 1499, 130- 
20 1 38). For this reason, we tested the effects of the overexpression of OB-RGRP on 
the expression of OB-R at the cell surface. 

The receptor distribution was studied by 125 l-leptin-binding experiments. In 
agreement with other authors (Barr et al., 1999), we showed that only 10 - 20% of 
the OB-R| and OB-R s receptors are expressed at the surface of transfected COS 
25 cells (Fig. 9) and of HeLa cells. This is not an artifact due to the expression of 
exogenous receptors since similar values are obtained in HEK 293 cells 
expressing endogenous OB-R receptors (Fig. 9). The overexpression of OB- 
RGRP showed no modification of the total amount in the cells, nor of the % of 
receptors expressed at the surface (Fig. 9). 

30 

Example 6: Characterization of OB-RGRP specific antisense 
deoxynucleotides 

OB-RGRP appears to have ubiquitous expression; for this reason, the 
35 decrease in expression of this protein was chosen as an alternative approach for 
studying its role in OB-R function. Fourteen antisenses specific for OB-RGRP (AS 
1 to 14) and two random antisenses (AS 15 and 16) were chosen (see Figure 1), 
synthesized, and then tested for their ability to inhibit OB-RGRP expression using 
semiquantitative RT-PCR experiments in HeLa cells expressing OB-RGRP 
40 endogenously (Fig. 10). Only one of these antisenses (AS-14), derived from the 
untranslated 3' region of the OB-RGRP mRNA, interferes with OB-RGRP 
expression. Labeling of this antisense with the Cy3 fluorophore made it possible to 
show that all of the cells were transfected under our experimental conditions, in 
our various experiments. 
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Example 7: Effect of the OB-RGRP-specific antisense on the signaling and 
surface expression of the OB-R 

HeLa cells were first cotransfected with the expression vectors for OB-R| 
5 and the reporter gene for STAT3, and then with the antisenses. Leptin causes an 
approximately 1.5-fold increase in the basal activation of the reporter gene for 
STAT3 in the control cells without antisense, or with a control antisense (AS16) 
(Fig. 11). In the cells transfected with the antisense specific for OB-RGRP (AS- 
14), the basal and leptin-stimulated signaling is relatively increased compared to 

10 the control conditions. This shows that activation of the JAK/STAT pathway is 
increased in the cells exhibiting a decrease in OB-RGRP expression. These 
observations may be explained by an inhibitory effect of OB-RGRP on the basal 
and OB-R-stimulated activity and, in this case, OB-RGRP can be considered to be 
a regulator of OB-R signaling. Another alternative is that OB-RGRP might regulate 

15 the expression of the surface receptors by limiting the number of OB-R reaching 
the cell surface. This is in agreement with the fact that only 10 to 20% of the 
receptors expressed reach the cell surface. In this hypothesis, the decrease in 
OB-RGRP expression should increase the number of receptors at the cell surface, 
which should increase the signaling by these receptors. To test this hypothesis, 

20 we quantified the number of OB-R| and OB-R s receptors expressed at the cell 
surface in the presence (control) and absence (AS-14) of OB-RGRP (Fig. 12). 
Transfection of the random antisense showed no effect on the number of 
receptors expressed at the cell surface, whereas that of the specific antisense 
(AS-14) caused a 3-fold increase in the number of OB-R expressed at the plasma 

25 membrane. Similar results were obtained in nontransfected HeLa cells expressing 
endogenous receptors. Under these experimental conditions, the total number of 
receptors, measured by 125 l-leptin-binding experiments, showed no significant 
variations. 

30 All our results are consistent with the role of OB-RGRP in yeast, in protein 

transport. The increase in surface expression of OB-R appears to be involved in 
the increase in signaling observed. However, we cannot entirely exclude the 
hypothesis that OB-RGRP directly regulates OB-R activity. The application of 
specific antisenses directed against OB-RGRP should be useful for increasing 

35 OB-R signaling in leptin-related disorders, such as human obesity, in which 
resistance to leptin is observed, characterized by an unadapted response to this 
hormone. The increase in expression of the receptors at the cell surface and in 
their signaling should be important for increasing the response to leptin in the 
case of human obesity, firstly by increasing leptin transport to the brain across the 

40 blood-brain barrier and, secondly, by increasing OB-R signaling in the 
hypothalamus. 

The interaction between OB-RGRP and OB-R s implies that the action of 
OB-RGRP takes place via this direct interaction with the receptors and that 
preventing this interaction may lead to the effects of the specific antisense ODN 
45 being reproduced. We propose using the BRET test of the interactions between 
OB-RGRP and OB-R s , and MY047 and OB-R s , described above, as a test for 
screening molecules which may modulate this interaction. This test may be 
carried out either on whole or permeabilized cells coexpressing the proteins from 
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fusions of the OB-RGRP and OB-R s , or MY047 and OB-R s BRET partners, or on 
membrane fractions derived from these cells. 
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CLAIMS 

1. An optionally modified oligonucleotide comprising from 8 to 50 
nucleotides which hybridizes specifically to the sequence SEQ ID No. 
1 and which inhibits OB-RGRP expression. 

2. The oligonucleotide as claimed in claim 1, which promotes the 
expression of leptin receptors at the cell surface. 

3. The oligonucleotide as claimed in either of claims 1 and 2, which is an 
antisense oligonucleotide. 

4. The oligonucleotide as claimed in one of claims 1 to 3, which 
comprises a sequence exhibiting at least 60% identity with the 
sequence SEQ ID No. 2. 

5. The oligonucleotide as claimed in one of claims 1 to 3, wherein 
nucleotides are thioesterified. 

6. The oligonucleotide as claimed in one of claims 1 to 3, wherein 
nucleotides are 2'-0-methylated. 

7. The oligonucleotide as claimed in one of claims 1 to 3, which has a 
triethylene glycol residue at its 3' end. 

8. The oligonucleotide as claimed in one of claims 1 to 3, which is single- 
stranded. 

9. The oligonucleotide as claimed in one of claims 1 to 8, which 
comprises a sequence exhibiting at least 60% identity with the 
sequence SEQ ID No. 2, in which the nucleotides at positions 2, 4, 6, 
7, 9, 11, 13, 15, 17, 19 and 20, in the 5' to 3' direction, are 
thioesterified. 

10. The oligonucleotide as claimed in one of claims 1 to 8, which 
comprises a sequence exhibiting at least 60% identity with the 
sequence SEQ ID No. 2, in which the nucleotides at positions 1, 2, 3, 
4, 5, 16, 17, 18, 19 and 20, in the 5' to 3' direction, are 2'-0- 
methylated. 

11. The oligonucleotide as claimed in one of claims 1 to 10, which is a 
DNA. 

12. An oligonucleotide of the iRNA type comprising from 15 to 25 
nucleotides, which hybridizes specifically to the sequence SEQ ID No. 
21 and which inhibits the expression of OB-RGRP. 

13. The oligonucleotide as claimed in claim 12, which is a double-stranded 
RNA. 
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14. A vector expressing an oligonucleotide as claimed in one of claims 1 to 
4 and 12. 

1 5. A cell containing a vector as claimed in either of claims 1 3 and 14. 

16. A medicinal product containing an oligonucleotide, a vector or a cell as 
claimed in one of claims 1 to 1 5. 

17. A pharmaceutical composition containing a pharmacologically active 
amount of an oligonucleotide, of a vector or of a cell as claimed in one 
of claims 1 to 15 and pharmaceutically acceptable excipients. 

18. The use of an oligonucleotide, of a vector of a cell as claimed in one of 
claims 1 to 15, for producing a medicinal product for preventing and/or 
treating leptin-related pathological conditions. 

19. A fusion protein, which is composed of a sequence exhibiting at least 
65% identity with the sequence SEQ ID No. 4, or the sequence SEQ ID 
No. 1 6, or of a substantial part of the sequence SEQ ID No. 4 or of the 
sequence SEQ ID No. 16, and of an energy-donor or energy-acceptor 
protein, or of a substantial and active part of an energy-donor or 
energy-acceptor protein. 

20. The fusion protein as claimed in claim 19, wherein the protein is a 
luciferase. 

21. The fusion protein as claimed in claim 19, wherein the protein is GFP 
or a mutant of this protein or DsRed. 

22. The fusion protein as claimed in claim 19, wherein the mutant of GFP 
is YFP, EYFP, wild-type GFP, GFPS65T or Topaz. 

23. The fusion protein as claimed in claim 19, which has the sequence 
SEQ ID No. 6, SEQ ID No. 8, SEQ ID No. 18 or SEQ ID No. 20. 

24. A nucleic acid encoding one of the proteins as claimed in one of claims 
19 to 23. 

25. The nucleic acid as claimed in claim 24, which has the sequence 
SEQ ID No. 5, SEQ ID No. 7, SEQ ID No. 17 or SEQ ID No. 19. 

26. A nucleic acid which exhibits at least 65% identity with the sequence as 
claimed in claim 25. 

27. A nucleic acid which hybridizes, under high stringency conditions, with 
the sequence as claimed in claim 25. 

28. A cell comprising a nucleic acid as claimed in one of claims 24 to 27. 
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29. A cell expressing a protein as claimed in one of claims 19 to 23. 

30. A fragment of a cell as claimed in either of claims 28 and 29. 

5 

31 . A lysate of a cell as claimed in either of claims 28 and 29. 

32. A membrane of a cell as claimed in either of claims 28 and 29. 

10 33. A method for determining the modification, by a compound, of the 

interaction between a protein exhibiting at least 65% identity with the 
sequence SEQ ID No. 4 or the sequence SEQ ID No. 16, and the leptin 
receptor, comprising the steps consisting in: 

- bringing said compound into contact with a protein exhibiting at least 65% 
15 identity with the sequence SEQ ID No. 4 or the sequence SEQ ID No. 16, and the 

leptin receptor, or cells, or fragments or lysates or membranes of cells, comprising 
such proteins, and optionally a suitable enzyme substrate, and 

- measuring the interaction between the protein exhibiting at least 65% 
identity with the sequence SEQ ID No. 4 or the sequence SEQ ID No. 16, 

20 and the leptin receptor. 

34. A method for determining the modification, by a compound, of the 
interaction between a protein exhibiting at least 65% identity with the 
sequence SEQ ID No. 4 or the sequence SEQ ID No. 16, and the leptin 

25 receptor, comprising the steps consisting in: 

- bringing said compound into contact with an energy-donor fusion protein and an 
energy-acceptor fusion protein, or cells, or fragments or lysates or membranes of 
cells, comprising such a protein, and optionally a suitable enzyme substrate, and 

- measuring the energy transfer. 

30 

35. The method as claimed in claim 34, wherein the energy-donor fusion 
protein is a protein from fusion between the leptin receptor, or a 
substantial part of the leptin receptor, and luciferase, or a substantial 
part of luciferase, and the energy-acceptor fusion protein is a fusion 

35 protein as claimed in claim 22. 

36. The method as claimed in claim 34, wherein the energy-donor fusion 
protein is a fusion protein as claimed in claim 20, and the energy- 
acceptor fusion protein is a protein from fusion between the leptin 

40 receptor, or a substantial part of the leptin receptor, and YFP, or a 

substantial part of YFP. 

37. The method as claimed in claim 34, wherein the energy transfer 
measured in the presence of the test compound is compared to that 

45 measured in the absence of the test compound. 



38. The method as claimed in claim 34, wherein the energy transfer 
measured in the presence of the test compound and the leptin (or a 
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ligand of the receptor) is compared to that measured in the presence of 
the compound in the absence of leptin (or a ligand of the receptor). 

39. A method for screening or detecting compounds intended for the 
prevention and/or treatment of leptin-related pathological conditions, 
comprising the steps consisting in: 

- bringing said compound into contact with a protein exhibiting at least 65% 
identity with the sequence SEQ ID No. 4 or the sequence SEQ ID No. 16, and the 
leptin receptor, or cells, or fragments or lysates or membranes of cells, comprising 
such proteins, and optionally a suitable enzyme substrate, and 

- measuring the interaction between a protein exhibiting at least 65% identity with 
the sequence SEQ ID No. 4 or the sequence SEQ ID No. 16, and the leptin 
receptor. 

40. The method as claimed in either of claims 34 and 35, wherein the 
fusion protein is a protein of sequence SEQ ID No. 6, SEQ ID No. 8, 
SEQ ID No. 12, SEQ ID No. 14, SEQ ID No. 18 or SEQ ID No. 20. 

41 . The method as claimed in one of claims 33 to 40, wherein the cells are 
treated with a permeabilizing agent. 



SEQUENCE LISTING 



<110> AVENTIS P HARM A 

<120> OB RGRP ANTISENSE 

<13 0> OB RGRP ANTISENSE 

<140> 
<141> 

<160> 21 

<170> Patent In Ver . 2.1 

<210> 1 
<211> 648 
<212> DNA 

<213> Komo sapiens 
<400> 1 

cactttattc tgattacagt gcattgaatt tcttagaact catactatct gtatacatgt 60 
gcacatgcgg cattttacta tgaaatttaa tatgctgggt tttttaatac ctttatatat 12 0 
catgttcact ttaagaaaga cttcataagt aggagatgag ttttattctc agcaaataga 180 
cctgtcaaat ttagattatg ttactcaaat tatgttactt. gtttggctgt tcatgtagtc 240 
acggtgctct cagaaaatat attaacgcag tcttgtaggc agctgccacc ttatgcagtg 3 00 
catcgaaacc ttttgcttgg ggatgtgctt ggagaggcag ataacgctga agcaggcctc 3 60 
tcatgaccca ggaaggccgg ggtggatccc tctttgtgtt gtagtccatg ctattaaaag 42 0 
tgtggcccac agaccaagag cctcaacatt tcctagagcc ttattagaaa tgcagaatct 480 
gaagccccac tctggaccca ggacattttg atgagatcca aaggagttgt atgcacatga 540 
aagtttgaga agcatcatca tagagaagta aacatcacac ccaacttcct tatctttcca 600 
gtggctaaac cacttaacct ctctgggcgt tacctgctca tttgttta 648 



<210> 2 
<211> 20 
<212> DNA 

<213> Artificial sequence 

<220> 

<223> Artificial sequence description : antisense AS14 



<400> 2 

aatgccgcat gtgcacatgt 20 



<210> 3 



<211> 396 
<212> DNA 

<2I3> Homo sapiens 



<220> 

<221> CDS 

<222> (1) . . (396) 



<400> 3 

atg gcg ggc gtt aaa get etc gtg gca tta tec ttc agt ggg get att 

Met Ala Gly Val Lys Ala Leu Val Ala Leu Ser Phe Ser Gly Ala lie 

15 10 15 



gga ctg act ttt ctt atg ctg gga tgt gec tta gag gat tat ggc gtt 
Gly Leu Thr Phe Leu Met Leu Gly Cys Ala Leu Glu Asp Tyr Gly Val 

20 25 30 



tac tgg ccc tta ttc gtc ctg att ttc cac gec ate tec ccc ate ccc 144 
Tyr Trp Pro Leu Phe Val Leu lie Phe His Ala lie Ser Pro lie Pro 
35 40 45 

cat ttc att gec aaa aga gtc ace tat gac tea gat gca acc agt agt 192 
His Phe lie Ala Lys Arg Val Thr Tyr Asp Ser Asp Ala Thr Ser Ser 
50 55 60 



gec tgt egg gaa ctg gca tat ttc ttc act act gga att gtt gtt tct 240 
Ala Cys Arg Glu Leu Ala Tyr Phe Phe Thr Thr Gly lie Val Val Ser 
65 70 75 80 



gec ttt ggfca ttt cct gtt att ctt get cgt gtg get gtg ate aaa tgg 288 
Ala Phe Gly Phe Pro Val lie Leu Ala Arg Val Ala Val lie Lys Trp 

85 90 95 



gga gec tgc ggc ctt gtg ttg gca ggc aat gca gtc att ttc ctt aca 336 
Gly Ala Cys Gly Leu Val Leu Ala Gly Asn Ala Val lie Phe Leu Thr 

100 105 110 



att caa ggg ttt ttc ctt ata ttt gga aga gga gat gat ttt age tgg 384 
lie Gin Gly Phe Phe Leu lie Phe Gly Arg Gly Asp Asp Phe Ser Trp 
115 120 125 



gag cag tgg tag 3 96 

Glu Gin Trp 
130 



<2I0> 4 
<211> 131 



<212> PRT 

<213> Homo sapiens 



<400> 4 

Met Ala Gly Val 

1 

Gly Leu Thr Phe 

20 

Tyr Trp Pro Leu 

35 

His Phe lie Ala 
50 

Ala Cys Arg Glu 
65 

Ala Phe Gly Phe 



Gly Ala Cys Gly 

100 

lie Gin Gly Phe 
115 

Glu Gin Trp 
130 



Lys Ala Leu Val 
5 

Leu Met Leu Gly 



Phe Val Leu lie 

40 

Lys Arg Val Thr 

55 

Leu Ala Tyr Phe 
7 0 

Pro Val lie Leu 
85 

Leu Val Leu Ala 



Phe Leu lie Phe 

120 



Ala Leu Ser Phe 
10 

Cys Ala Leu Glu 
25 

Phe His Ala lie 



Tyr Asp Ser Asp 

60 

Phe Thr Thr Gly 

75 

Ala Arg Val Ala 
90 

Gly Asn Ala Val 
105 

Gly Arg Gly Asp 



Ser Gly Ala lie 

15 

Asp Tyr Gly Val 
30 

Ser Pro lie Pro 
45 

Ala Thr Ser Ser 



He Val Val Ser 

30 

Val He Lys Trp 

95 

lie Phe Leu Thr 
110 

Asp Phe Ser Trp 
125 



<210> 5 

<211> 1359 

<212> DNA 

<2 1 3 > Artificial sequence 

<220> 

<221> CDS 

<222> (1) . . (1359) 

<220> Artificial sequence description - OB RGRP LUC 
<223> 



<400> 5 

atg gcg ggc gtt aaa get etc gtg gca tta tec ttc agt ggg get att 



Met Ala Gly Val Lys Ala Leu Val Ala Leu Ser Phe Ser Gly Ala He 



4 

X 



5 10 15 



96 



144 



192 



gga ctg act ttt ctt atg ctg gga tgt gcc tta gag gat tat ggc gtt 

Gly Leu Thr Phe Leu Met Leu Gly Cys Ala Leu Glu Asp Tyr Gly Val 

20 25 30 

tac tgg ccc tta ttc gtc ctg att ttc cac gcc ate tec ccc ate ccc 

Tyr Trp Pro Leu Phe Val Leu He Phe His Ala He Ser Pro He Pro 
35 40 45 

cat ttc att gcc aaa aga gtc acc tat gac tea gat gca acc agt agt 

His Phe He Ala Lys Arg Val Thr Tyr Asp Ser Asp Ala Thr Ser Ser 
50 55 60 

rfCr 1 trrl- caO rsPi^ rfff nnfl tat ttC ttC aCt act OCa att att att tCt 240 

Ala Cys Arg Glu Leu Ala Tyr Phe Phe Thr Thr Gly He Val Val Ser 

65 70 75 SO 

gcc ttt gga ttt cct gtt att ctt get cgt gtg get gtg ate aaa tgg 288 

Ala Phe Gly Phe Pro Val He Leu Ala Arg Val Ala Val He Lys Trp 

85 90 95 

gga gcc tgc ggc ctt gtg ttg gca ggc aat gca gtc att ttc ctt aca 336 

Gly Ala Cys Gly Leu Val Leu Ala Gly Asn Ala Val He Phe Leu Thr 

100 105 HO 

att caa ggg ttt ttc ctt ata ttt gga aga gga gat gat ttt age tgg 3 84 

He Gin Gly Phe Phe Leu He Phe Gly Arg Gly Asp Asp Phe Ser Trp 

115 120 125 

gag cag tgg att ccg ggg gat cca ccg get aga gcc acc atg acc age 432 

Glu Gin Trp He Pro Gly Asp Pro Pro Ala Arg Ala Thr Met Thr Ser 

130 135 140 

aag gtg tac gac ccc gag cag agg aag agg atg ate acc ggc ccc cag 480 

Lys Val Tyr Asp Pro Glu Gin Arg Lys Arg Met He Thr Gly Pro Gin 

145 150 155 160 

tgg tgg gcc agg tgc aag cag atg aac gtg ctg gac age ttc ate aac 52 8 

Trp Trp Ala Arg Cys Lys Gin Met Asn Val Leu Asp Ser Phe He Asn 

165 170 175 

tac tac gac age gag aag cac gcc gag aac gcc gtg ate ttc ctg cac 576 

Tyr Tyr Asp Ser Glu Lys His Ala Glu Asn Ala Val He Phe Leu His 

180 185 190 

ggc aac gcc get age age tac ctg tgg agg cac gtg gtg ccc cac ate 624 



Gly Asn Ala 
195 

gag ccc gtg 
Glu Pro Val 
210 

age ggc aag 
Ser Gly Lys 
225 

tac ctg acc 
Tyr Leu Thr 



+™ +r ^-T J - /~Y ^/t r^ - 

t_ l-v- y u-^-j ^ ^ 

Phe Val Gly 



gag cac cag 

Glu His Gin 
275 

gac gtg ate 
Asp Val lie 
290 

gec ctg ate 
Ala Leu lie 
305 

ttc ttc gtg 
Phe Phe Val 



ccc gag gag 
Pro Glu Glu 



gtg aga aga 
Val Arg Arg 
355 

ggc ggc aag 
Gly Gly Lys 
370 

ctg aga gec 



Ala Ser Ser 



gec agg tgc 
Ala Arg Cys 

age ggc aac 
Ser Gly Asn 
230 

gec tgg ttc 
Ala Trp Phe 
245 

cac gac tgg 
His Asp Trp 
260 

gac aag ate 
Asp Lys lie 



gag age tgg 
Glu Ser Trp 



aag age gag 
Lys Ser Glu 
310 

gag acc atg 
Glu Thr Met 
325 

ttc gec gec 
Phe Ala Ala 
340 

ccc ace ctg 
Pro Thr Leu 

ccc gac gtg 
Pro Asp Val 

age gac gac 



Tyr Leu Trp 
200 

ate ate ccc 
lie lie Pro 
215 

ggc age tac 
Gly Ser Tyr 

gag etc ctg 
Glu Leu Leu 



Gly Ala Cys 
265 

aag gec ate 
Lys Ala lie 
280 

gac gag tgg 
Asp Glu Trp 
295 

gag ggc gag 
Glu Gly Glu 

ctg ccc age 
Leu Pro Ser 



tac ctg gag 
Tyr Leu Glu 
345 

age tgg ccc 
Ser Trp Pro 
360 

gtg cag ate 
Val Gin lie 
375 

ctg ccc aag 



Arg His Val 

gat ctg ate 
Asp Leu lie 
220 

agg ctg ctg 
Arg Leu Leu 
235 

aac ctg ccc 
Asn Leu Pro 
250 

Leu Ala Phe 



gtg cac gec 
Val His Ala 

cca gac ate 
Pro Asp lie 
300 

aag atg gtg 

Lys Met Val 
315 

aag ate atg 
Lys lie Met 
330 

ccc ttc aag 
Pro Phe Lys 

aga gag ate 
Arg Glu He 

gtg aga aac 
Val Arg Asn 
380 

atg ttc ate 



Val Pro His 
205 

ggc atg ggc 
Gly Met Gly 

gac cac tac 
Asp His Tyr 

aag aag ate 
Lys Lys He 
255 

cac tac age 
His Tyr Ser 

270 

gag age gtg 

Glu Ser Val 
285 

gag gag gac 

Glu Glu Asp 



ctg gag aac 
Leu Glu Asn 



aga aag ctg 
Arg Lys Leu 
33 5 

gag aag ggc 
Glu Lys Gly 
350 

ccc ctg gtg 
Pro Leu Val 
365 

tac aac gee 
Tyr Asn Ala 



gag age gac 



aag 672 
Lys 

aag 720 

Lys 

240 

ate 768 
lie 



tac 816 
Tyr 



gtg 864 
Val 



ate 912 
He 

aac 960 

Asn 

320 

gag 1008 

Glu 



gag 1056 
Glu 

aag 1104 
Lys 

tac 1152 
Tyr 

ccc 1200 



Leu Arg Ala Ser Asp Asp Leu Pro Lys Met Phe lie Giu Ser Asp Pro 
385 390 395 400 



ggc ttc ttc age aac gec ate gtg gag ggc gec aag aag ttc ccc aac 1248 

Gly Phe Phe Ser Asn Ala lie Val Glu Gly Ala Lys Lys Phe Pro Asn 

405 410 415 

acc gag ttc gtg aag gtg aag ggc ctg cac ttc age cag gag gac gec 1296 
Thr Glu Phe Val Lys Val Lys Gly Leu Kis Phe Ser Gin Glu Asp Ala 

420 425 430 

ccc gac gag atg ggc aag tac ate aag age ttc gtg gag aga gtg ctg 13 44 
Pro Asp Glu Met Gly Lys Tyr lie Lys Ser Phe Val Glu Arg Val Leu 
435 440 445 

aag aac gag cag taa 13 59 

Lys Asn Glu Gin 
450 



<210> 6 
<211> 452 
<212> PRT 

<213> Artificial sequence 

<223> Artificial sequence description: OB RGRP LUC 
<400> 6 

Met Ala Gly Val Lys Ala Leu Val Ala Leu Ser Phe Ser Gly Ala lie 
15 10 15 

Gly Leu Thr Phe Leu Met Leu Gly Cys Ala Leu Glu Asp Tyr Gly Val 

20 25 30 

Tyr Trp Pro Leu Phe Val Leu lie Phe His Ala lie Ser Pro He Pro 
35 40 45 

His Phe He Ala Lys Arg Val Thr Tyr Asp Ser Asp Ala Thr Ser Ser 
50 55 60 

Ala Cys Arg Giu Leu Ala Tyr Phe Phe Thr Thr Gly He Val Val Ser 

65 70 75 80 

Ala Phe Gly Phe Pro Val He Leu Ala Arg Val Ala Val He Lys Trp 

85 90 95 



Gly Ala Cys Gly Leu Val Leu Ala Gly Asn Ala Val He Phe Leu Thr 

100 105 110 



He Gin Gly Phe 

JL 1 5 

GIu Gin Trp He 
130 

Lys Val Tyr Asp 
145 

Trp Trp Ala Arg 



Tyr Tyr Asp Ser 

180 

Gly Asn Ala Ala 
195 

Glu Pro Val Ala 

itfj .2.. Q 

Ser Gly Lys Ser 

^J^e 1."^ 

Tyr Leu Thr Ala 



Phe Val Gly His 

260 

Glu His Gin Asp 
275 

Asp Val He Glu 
290 

Ala Leu He Lys 
305 

Phe Phe Val Glu 



Pro Glu Glu Phe 

340 

Val Arg Arg Pro 
355 



Phe Leu He Phe 

120 

Pro Gly Asp Pro 
135 

Pro Glu Gin Arg 
150 

Cys Lys Gin Met 

165 

Glu Lys His Ala 



Ser Ser Tyr Leu 

200 

Arg Cys He He 
215 

Gly Asn Gly Ser 
230 

Trp Phe Glu Leu 
245 

Asp Trp Gly Ala 



Lys He Lys Ala 

280 

Ser Trp Asp Glu 
295 

Ser Glu Glu Gly 

310 

Thr Met Leu Pro 
325 

Ala Ala Tyr Leu 



Thr Leu Ser Trp 

3 60 



Gly Arg Gly Asp 



Pro Ala Arg Ala 

140 

Lys Arg Met He 
155 

Asn Val Leu Asp 
170 

Glu Asn Ala Val 
185 

Trp Arg His Val 



Pro Asp Leu He 

Tyr Arg Leu Leu 
235 

Leu Asn Leu Pro 
250 

Cys Leu Ala Phe 
265 

He Val His Ala 



Trp Pro Asp lie 

300 

Glu Lys Met Val 
315 

Ser Lys He Met 
330 

Glu Pro Phe Lys 

345 

Pro Arg Glu He 



Asp Phe Ser Trp 
125 

Thr Met Thr Ser 



Thr Gly Pro Gin 

160 

Ser Phe He Asn 
175 

lie Phe Leu His 
190 

Val Pro His He 
205 

Gly Met Gly Lys 



Asp His Tyr Lys 

240 

Lys Lys He He 

255 

His Tyr Ser Tyr 
270 

Glu Ser Val Val 
285 

Glu Glu Asp He 



Leu Glu Asn Asn 

320 

Arg Lys Leu Glu 
335 

Glu Lys Gly Glu 
350 

Pro Leu Val Lys 
365 



Gly Gly Lys Pro 
370 

Leu Arg Ala Ser 
385 

Gly Phe Phe Ser 



Thr Glu Phe Val 

420 

Pro Asp Glu Met 
435 

Lys Asn Glu Gin 
450 



Asp Val Val Gin 
375 

Asp Asp Leu Pro 
390 

Asn Ala He Val 
405 

Lys Val Lys Gly 



Gly Lys Tyr He 

440 



He Val Arg Asn 

380 

Lys Met Phe He 
395 

Glu Gly Ala Lys 
410 

Leu His Phe Ser 

425 

Lys Ser Phe Val 



Tyr Asn Ala Tyr 



Glu Ser Asp Pro 

400 

Lys Phe Pro Asn 
415 

Gin Glu Asp Ala 
430 

Glu Arg Val Leu 
445 



<210> 7 

<211> 1128 

<212> DNA 

<213> Artificial sequence 

<220> 

<221> CDS 

<222> (1) . , (1128) 

<220> 

<223> Artificial sequence description : OB RGRP YFP 



<400> 7 

atg gcg ggc gtt aaa get etc gtg gca tta tec etc agt ggg get ate 48 

Met Ala Gly Val Lys Ala Leu Val Ala Leu Ser Phe Ser Gly Ala He 
15 10 15 

gga ctg act ttt ctt atg ctg gga tgt gec tea gag gat tat ggc gtt 96 
Gly Leu Thr Phe Leu Met Leu Gly Cys Ala Leu Glu Asp Tyr Gly Val 

20 25 30 

tac tgg ccc tta ttc gtc ctg att ttc cac gec ate tec ccc ate ccc 144 
Tyr Trp Pro Leu Phe Val Leu lie Phe His Ala He Ser Pro He Pro 
35 40 45 



cat ttc att gec aaa aga gtc acc tat gac tea gat gca acc agt agt 192 



His Phe lie Ala Lys Arg Val Thr Tyr Asp Ser Asp Ala Thr Ser Ser 
50 55 60 



288 



gcc tgt egg gaa ctg gca tat ttc ttc act act gga att gtt gtt tct 240 

Ala Cys Arg Glu Leu Ala Tyr Phe Phe Thr Thr Gly He Val Val Ser 

65 70 75 80 

gcc ttt gga ttt cct gtt att ctt get cgt gtg get gtg ate aaa tgg 

Ala Phe Gly Phe Pro Val He Leu Ala Arg Val Ala Val He Lys Trp 

85 90 9$ 

gga gcc tgc ggc ctt gtg ttg gca ggc aat gca gtc att ttc ctt aca 33 6 

Gly Ala Cys Gly Leu Val Leu Ala Gly Asn Ala Val He Phe Leu Thr 

100 105 HO 

att caa ggg ttt ttc ctt ata ttt gga aga gga gat gat ttt age tgg 384 

He Gin Gly Phe Phe Leu He Phe Gly Arg Gly Asp Asp Phe Ser Trp 

115 120 125 

gag cag tgg att ccg gtc gcc acc atg gtg age aag ggc gag gag ctg 432 

Glu Gin Trp He Pro Val Ala Thr Met Val Ser Lys Gly Glu Glu Leu 

130 135 140 

ttc acc ggg gtg gtg ccc ate ctg gtc gag ctg gac ggc gac gta aac 480 

Phe Thr Gly Val Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn 

145 150 155 160 

ggc cac aag ttc age gtg tec ggc gag ggc gag ggc gat gcc acc tac 52 8 

Gly His Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr 

165 170 175 

ggc aag ctg acc ctg aag ttc ate tgc acc acc ggc aag ctg ccc gtg 576 

Gly Lys Leu Thr Leu Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val 

180 185 190 

ccc tgg ccc acc etc gtg acc acc ttc ggc tac ggc gtg cag tgc ttc 624 

Pro Trp Pro Thr Leu Val Thr Thr Phe Gly Tyr Gly Val Gin Cys Phe 

195 200 205 

gcc cge tac ccc gac cac atg cgc cag cac gac ttc ttc aag tec gcc 672 

Ala Arg Tyr Pro Asp His Met Arg Gin His Asp Phe Phe Lys Ser Ala 

210 215 220 

atg ccc gaa ggc tac gtc cag gag cgc acc ate ttc ttc aag gac gac 720 

Met Pro Glu Gly Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp 

225 230 235 240 



ggc aac tac aag acc cgc gcc gag gtg aag ttc gag ggc gac acc ctg 768 



Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu 

245 250 255 



gtg aac cgc ate gag ctg aag ggc ate gac ttc aag gag gac ggc aac 816 

Val Asn Arg lie Glu Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn 

260 265 270 

ate ctg ggg cac aag ctg gag tac aac tac aac age cac aac gtc tat 864 

lie Leu Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr 

275 280 285 

ate atg gee gac aag cag aag aac ggc ate aag gtg aac ttc aag ate 912 

lie Met Ala Asp Lys Gin Lys Asn Gly lie Lys Val Asn Phe Lys lie 
290 295 300 

cgc cac aac ate gag gac ggc age gtg cag etc gee gac cac tac cag 960 

Arg His Asn lie Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin 
305 310 315 320 

cag aac acc ccc ate ggc gac ggc ccc gtg ctg ctg ccc gac aac cac 1008 

Gin Asn Thr Pro lie Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His 

325 330 335 

tac ctg age tac cag tec gee ctg age aaa gac ccc aac gag aag cgc 105 6 

Tyr Leu Ser Tyr Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg 

340 345 350 

gat cac atg gtc ctg ctg gag ttc gtg acc gee gec ggg ate act etc 1104 

Asp His Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly lie Thr Leu 

355 360 365 

ggc atg gac gag ctg tac aag taa 112 8 

Gly Met Asp Glu Leu Tyr Lys 
370 375 



<210> 8 

<211> 375 

<212> PRT 

<213> Artificial sequence 

<2 23> Artificial sequence description : OB RGRP YFP 



<400> 8 

Met Ala Gly Val Lys Ala Leu Val Ala Leu Ser Phe Ser Gly Ala He 
15 10 15 

Gly Leu Thr Phe Leu Met Leu Gly Cys Ala Leu Glu Asp Tyr Gly Val 



20 



25 



30 



Tyr Trp Pro Leu 

35 

His Phe lie Ala 

50 

Ala Cys Arg Glu 
65 

Ala Phe Gly Phe 



Gly Ala Cys Gly 

100 

lie Gin Gly Phe 
115 

Glu Gin Trp lie 
130 

Phe Thr Gly Val 
145 

Gly His Lys Phe 



Gly Lys Leu Thr 

180 

Pro Trp Pro Thr 

195 

Ala Arg Tyr Pro 

210 

Met Pro Glu Gly 
225 

Gly Asn Tyr Lys 



Val Asn Arg lie 

260 

He Leu Gly His 



Phe Val Leu He 

40 

Lys Arg Val Thr 

55 

Leu Ala Tyr Phe 

70 

Pro Val He Leu 

85 

Leu Val Leu Ala 



Phe Leu He Phe 

120 

Pro Val Ala Thr 

135 

Val Pro lie Leu 
150 

Ser Val Ser Gly 
165 

Leu Lys Phe He 



Leu Val Thr Thr 

200 

Asp His Met Arg 

215 

Tyr Val Gin Glu 
230 

Thr Arg Ala Glu 
245 

Glu Leu Lys Gly 



Lys Leu Glu Tyr 



Phe His Ala He 



Tyr Asp Ser Asp 

60 

Phe Thr Thr Gly 

75 

Ala Arg Val Ala 
90 

Gly Asn Ala Val 
105 

Gly Arg Gly Asp 



Met Val Ser Lys 

140 

Val Glu Leu Asp 
155 

Glu Gly Glu Gly 
170 

Cys Thr Thr Gly 
185 

Phe Gly Tyr Gly 



Gin His Asp Phe 

220 

Arg Thr He Phe 
235 

Val Lys Phe Glu 
250 

He Asp Phe Lys 

265 

Asn Tyr Asn Ser 



Ser Pro He Pro 
45 

Ala Thr Ser Ser 



He Val Val Ser 

80 

Val He Lys Trp 
95 

He Phe Leu Thr 
110 

Asp Phe Ser Trp 
125 

Gly Glu Glu Leu 



Gly Asp Val Asn 

160 

Asp Ala Thr Tyr 
175 

Lys Leu Pro Val 
190 

Val Gin Cys Phe 
205 

Phe Lys Ser Ala 



Phe Lys Asp Asp 

240 

Gly Asp Thr Leu 
255 

Glu Asp Gly Asn 

270 

His Asn Val Tyr 



432 



480 



gag aca get gtt gaa cct aag ttt aat tea agt ggt act cac ttt tct 240 
Glu Thr Ala Val Glu Pro Lys Phe Asn Ser Ser Gly Thr His Phe Ser 
65 70 75 80 

aac tta tec aaa aca act ttc cac tgt tgc ttt egg agt gag caa gat 288 
Asn Leu Ser Lys Thr Thr Phe His Cys Cys Phe Arg Ser Glu Gin Asp 

85 90 95 

aga aac tgc tec tta tgt gca gac aac att gaa gga aag aca ttt gtt 336 
Arg Asn Cys Ser Leu Cys Ala Asp Asn He Glu Gly Lys Thr Phe Val 

100 105 HO 

tea aca gta aat tct tta gtt ttt caa caa ata gat gca aac tgg aac 384 
Ser Thr Val Asn Ser Leu Vai Phe Gin Gin lie Asp Ala Asn Trp Asn 
115 120 125 

ata cag tgc tgg eta aaa gga gac tta aaa tta ttc ate tgt tat gtg 
lie Gin Cys Trp Leu Lys Gly Asp Leu Lys Leu Phe lie Cys Tyr Val 

130 135 140 

gag tea tta ttt aag aat eta ttc agg aat tat aac tat aag gtc cat 
Glu Ser Leu Phe Lys Asn Leu Phe Arg Asn Tyr Asn Tyr Lys Val His 
145 150 155 160 

ctt tta tat gtt ctg cct gaa gtg tta gaa gat tea cct ctg gtt ccc 528 
Leu Leu Tyr Val Leu Pro Glu Val Leu Glu Asp Ser Pro Leu Val Pro 

165 170 175 

caa aaa ggc agt ttt cag atg gtt cac tgc aat tgc agt gtt cat gaa 576 
Gin Lys Gly Ser Phe Gin Met Val His Cys Asn Cys Ser Val His Glu 

180 185 190 

tgt tgt gaa tgt ctt gtg cct gtg cca aca gec aaa etc aac gac act 624 
Cys Cys Glu Cys Leu Val Pro Val Pro Thr Ala Lys Leu Asn Asp Thr 

195 200 205 

etc ctt atg tgt ttg aaa ate aca tct ggt gga gta att ttc cag tea 672 
Leu Leu Met Cys Leu Lys lie Thr Ser Gly Gly Vai lie Phe Gin Ser 

210 215 220 

cct eta atg tea gtt cag ccc ata aat atg gtg aag cct gat cca cca 720 
Pro Leu Met Ser Val Gin Pro lie Asn Met Val Lys Pro Asp Pro Pro 
225 230 235 240 

tta ggt ttg cat atg gaa ate aca gat gat ggt aat tta aag att tct 768 
Leu Gly Leu His Met Glu lie Thr Asp Asp Gly Asn Leu Lys He Ser 

245 250 255 



tgg tec age cca cca ttg gta cca ttt cca ctt caa tat caa gtg aaa 816 

Trp Ser Ser Pro Pro Leu Val Pro Phe Pro Leu Gin Tyr Gin Val Lys 

260 265 270 

tat tea gag aat tct aca aca gtt ate aga gaa get gac aag att gtc 864 

Tyr Ser Glu Asn Ser Thr Thr Val lie Arg Glu Ala Asp Lys He Val 

275 280 285 

tea get aca tec ctg eta gta gac agt ata ctt cct ggg tct teg tat 912 

Ser Ala Thr Ser Leu Leu Val Asp Ser He Leu Pro Gly Ser Ser Tyr 

290 295 300 

gag gtt cag gtg agg ggc aag aga ctg gat ggc cca gga ate tgg agt 960 

Glu Val Gin Val Arg Gly Lys Arg Leu Asp Gly Pro Gly He Trp Ser 

305 310 315 320 

gac tgg agt act cct cgt gtc ttt acc aca caa gat gtc ata tac ttt 1008 

Asp Trp Ser Thr Pro Arg Val Phe Thr Thr Gin Asp Val He Tyr Phe 

325 330 335 

cca cct aaa att ctg aca agt gtt ggg tct aat gtt tct ttt cac tgc 1056 

Pro Pro Lys lie Leu Thr Ser Val Gly Ser Asn Val Ser Phe His Cys 

340 345 350 

ate tat aag aag gaa aac aag att gtt ccc tea aaa gag att gtt tgg 1104 

He Tyr Lys Lys Glu Asn Lys lie Val Pro Ser Lys Glu He Val Trp 

355 360 365 

tgg atg aat tta get gag aaa att cct caa age cag tat gat gtt gtg 1152 

Trp Met Asn Leu Ala Glu Lys He Pro Gin Ser Gin Tyr Asp Val Val 

370 375 380 

agt gat cat gtt age aaa gtt act ttt ttc aat ctg aat gaa acc aaa 1200 

Ser Asp His Val Ser Lys Val Thr Phe Phe Asn Leu Asn Glu Thr Lys 

385 390 395 400 

cct cga gga aag ttt acc tat gat gca gtg tac tgc tgc aat gaa cat 1248 

Pro Arg Gly Lys Phe Thr Tyr Asp Ala Val Tyr Cys Cys Asn Glu His 

405 410 415 

gaa tgc cat cat cgc tat get gaa tta tat gtg att gat gtc aat ate 1296 

Glu Cys His His Arg Tyr Ala Glu Leu Tyr Val He Asp Val Asn He 

420 425 430 

aat ate tea tgt gaa act gat ggg tac tta act aaa atg act tgc aga 1344 

Asn He Ser Cys Glu Thr Asp Gly Tyr Leu Thr Lys Met Thr Cys Arg 

435 440 445 



tgg tea acc agt aca ate eag tea ctt gcg gaa age act ttg caa ttg 13 92 

Trp Ser Thr Ser Thr He Gin Ser Leu Ala Glu Ser Thr Leu Gin Leu 

450 455 460 

agg tat cat agg age age ctt tac tgt tct gat att cca tct att cat 1440 

Arg Tyr His Arg Ser Ser Leu Tyr Cys Ser Asp He Pro Ser He His 

465 470 475 480 

ccc ata tct gag ccc aaa gat tgc tat ttg cag agt gat ggt ttt tat 1488 

Pro He Ser Glu Pro Lys Asp Cys Tyr Leu Gin Ser Asp Gly Phe Tyr 

485 490 495 

gaa tgc att ttc cag cca ate ttc eta tta tct gge tac aca atg tgg 1536 

Glu Cys He Phe Gin Pro He Phe Leu Leu Ser Gly Tyr Thr Met Trp 

500 505 510 

att agg ate aat cac tct eta ggt tea ctt gac tct cca cca. aca tgt 1584 

He Arg He Asn His Ser Leu Gly Ser Leu Asp Ser Pro Pro Thr Cys 

515 520 525 

gtc ctt cct gat tct gtg gtg aag cca ctg cct cca tec agt gtg aaa 1632 

Val Leu Pro Asp Ser Val Val Lys Pro Leu Pro Pro Ser Ser Val Lys 

530 535 540 

gca gaa att act ata aac att gga tta ttg aaa ata tct tgg gaa aag 1680 

Ala Glu He Thr He Asn He Gly Leu Leu Lys He Ser Trp Glu Lys 

545 550 555 560 

cca gtc ttt cca gag aat aac ctt caa ttc cag att cgc tat ggt tta 1728 

Pro Val Phe Pro Glu Asn Asn Leu Gin Phe Gin He Arg Tyr Gly Leu 

565 570 575 

agt gga aaa gaa gta caa tgg aag atg tat gag gtt tat gat gca aaa 1776 

Ser Gly Lys Glu Val Gin Trp Lys Met Tyr Glu Val Tyr Asp Ala Lys 

580 585 590 

tea aaa tct gtc agt etc cca get cca gac ttg tgt gca gtc tat get 1824 

Ser Lys Ser Val Ser Leu Pro Val Pro Asp Leu Cys Ala Val Tyr Ala 

595 600 605 

gtt cag gtg cgc tgt aag agg eta gat gga ctg gga tat tgg agt aat 1872 

Val Gin Val Arg Cys Lys Arg Leu Asp Gly Leu Gly Tyr Trp Ser Asn 

610 615 620 

tgg age aat cca gee tac aca gtt gtc atg gat ata aaa gtt cct atg 1920 

Trp Ser Asn Pro Ala Tyr Thr Val Val Met Asp He Lys Val Pro Met 

625 630 635 640 



2016 



2064 



aga gga cct gaa ttt tgg aga ata att aat gga gat act atg aaa aag 1968 

Arg Gly Pro Glu Phe Trp Arg He lie Asn Gly Asp Thr Met Lys Lys 

645 650 655 

gag aaa aat gtc ace tta ctt tgg aag ccc ctg atg aaa aat gac tea 

Glu Lys Asn Val Thr Leu Leu Trp Lys Pro Leu Met Lys Asn Asp Ser 

660 665 670 

ttg tgc agt gtt cag aga tat gtg ata aac cat cat act tec tgc aat 

Leu Cys Ser Val Gin Arg Tyr Val He Asn His His Thr Ser Cys Asn 

675 680 685 

gga aca tgg tea gaa gat gtg gga aat cac acg aaa ttc act ttc ctg 2112 

Gly Thr Trp Ser Glu Asp Val Gly Asn His Thr Lys Phe Thr Phe Leu 

690 695 700 

tgg aca gag caa gca cat act gtt acg gtt ctg gec ate aat tea att 2160 

Trp Thr Glu Gin Ala His Thr Val Thr Val Leu Ala He Asn Ser lie 

705 710 715 720 

ggt get tct gtt gca aat ttt aat tta ace ttt tea tgg cct atg age 2208 

Gly Ala Ser Val Ala Asn Phe Asn Leu Thr Phe Ser Trp Pro Met Ser 

725 730 735 

aaa gta aat ate gtg cag tea etc agt get tat cct tta aac age agt 2256 

Lys Val Asn He Val Gin Ser Leu Ser Ala Tyr Pro Leu Asn Ser Ser 

740 745 750 

tgt gtg att gtt tec tgg ata eta tea ccc agt gat tac aag eta atg 2304 

Cys Val He Val Ser Trp He Leu Ser Pro Ser Asp Tyr Lys Leu Met 

755 760 765 

tat ttt att att gag tgg aaa aat ctt aat gaa gat ggt gaa ata aaa 2352 

Tyr Phe He He Glu Trp Lys Asn Leu Asn Glu Asp Gly Glu He Lys 

770 775 780 

tgg ctt aga ate tct tea tct gtt aag aag tat tat ate cat gat cat 2400 

Trp Leu Arg He Ser Ser Ser Val Lys Lys Tyr Tyr He His Asp His 

785 790 795 800 

ttt ate ccc att gag aag tac cag ttc agt ctt tac cca ata ttt atg 2448 

Phe He Pro He Glu Lys Tyr Gin Phe Ser Leu Tyr Pro lie Phe Met 

805 810 815 

gaa gga gtg gga aaa cca aag ata att aat agt ttc act caa gat gat 2496 

Glu Gly Val Gly Lys Pro Lys lie He Asn Ser Phe Thr Gin Asp Asp 

820 825 830 



att gaa aaa cac cag agt gat gca ggt tta tat gta att gtg cca gta 
He Glu Lys His Gin Ser Asp Ala Gly Leu Tyr Val He Val Pro Val 
835 840 345 



2544 



att att tec tct tec ate tta ttg ctt gga aca tta tta ata tea cac 2592 

He He Ser Ser Ser He Leu Leu Leu Gly Thr Leu Leu He Ser His 

850 855 860 

caa aga atg aaa aag eta ttt tgg gaa gat get ccg aac ccc aag aat 2 640 

Gin Arg Met Lys Lys Leu Phe Trp Glu Asp Val Pro Asn Pro Lys Asn 
865 870 375 880 



tgt tec tgg gca caa gga ctt aat ttt cag aag aga acg gac att ctt 
Cys Ser Trp Ala Gin Gly Leu Asn Phe Gin Lys Arg Thr Asp He Leu 

885 890 395 



tga 



2688 



269 



<210> 10 

<211> 896 

<212> PRT 

< 2 1 3 > Homo sapiens 

<400> 10 

Met He Cys Gin Lys 
1 5 

Tyr Val He Thr Ala 

20 

Phe Lys Leu Ser Cys 
35 

Leu Pro Ala Gly Leu 
50 

Glu Thr Ala Val Glu 

65 

Asn Leu Ser Lys Thr 

85 

Arg Asn Cys Ser Leu 

100 

Ser Thr Val Asn Ser 



Phe Cys Val Val Leu Leu 

10 

Phe Asn Leu Ser Tyr Pro 

25 

Met Pro Pro Asn Ser Thr 
40 

Ser Lys Asn Thr Ser Asn 
55 

Pro Lys Phe Asn Ser Ser 

70 75 

Thr Phe His Cys Cys Phe 

90 

Cys Ala Asp Asn He Glu 

105 

Leu Val Phe Gin Gin He 



His Trp Glu Phe He 

He Thr Pro Trp Arg 

30 

Tyr Asp Tyr Phe Leu 
45 

Ser Asn Gly His Tyr 
60 

Gly Thr His Phe Ser 

80 

Arg Ser Glu Gin Asp 

95 

Gly Lys Thr Phe Val 
110 

Asp Ala Asn Trp Asn 



115 



120 



125 



lie Gin Cys Trp 
130 

Glu Ser Leu Phe 
145 

Leu Leu Tyr Val 



Gin Lys Gly Ser 

180 

Cys Cys Glu Cys 
195 

Leu Leu Met Cys 

Pro Leu Met Ser 
225 

Leu Gly Leu His 



Trp Ser Ser Pro 

260 

Tyr Ser Glu Asn 
275 

Ser Ala Thr Ser 
290 

Glu Val Gin Val 
305 

Asp Trp Ser Thr 



Pro Pro Lys lie 

340 

lie Tyr Lys Lys 
355 

Trp Met Asn Leu 



Leu Lys Gly Asp 
13 5 

Lys Asn Leu Phe 
150 

Leu Pro Glu Val 
165 

Phe Gin Met Val 



Leu Val Pro Val 

200 

Leu Lys lie Thr 

Val Gin Pro lie 
230 

Met Glu lie Thr 
245 

Pro Leu Val Pro 



Ser Thr Thr Val 

280 

Leu Leu Val Asp 
295 

Arg Gly Lys Arg 

310 

Pro Arg Val Phe 
325 

Leu Thr Ser Val 



Glu Asn Lys lie 

360 

Ala Glu Lys He 



Leu Lys Leu Phe 

140 

Arg Asn Tyr Asn 
15 5 

Leu Glu Asp Ser 
170 

His Cys Asn Cys 
185 

Pro Thr Ala Lys 



Ser Gly Gly Val 

220 

Asn Met val Lys 
235 

Asp Asp Gly Asn 
250 

Phe Pro Leu Gin 
265 

lie Arg Glu Ala 



ser lie Leu Pro 

300 

Leu Asp Gly Pro 

3 x, 5 

Thr Thr Gin Asp 
330 

Gly Ser Asn Val 
345 

Val Pro Ser Lys 



Pro Gin Ser Gin 



lie Cys Tyr Val 



Tyr Lys Val His 

160 

Pro Leu Val Pro 
175 

Ser Val His Glu 
190 

Leu Asn Asp Thr 
205 

He Phe Gin Ser 



Pro Asp Pro Pro 

240 

Leu Lys He Ser 
255 

Tyr Gin Val Lys 
270 

Asp Lys He Val 
285 

Gly Ser Ser Tyr 



Gly He Trp Ser 

320 

Val He Tyr Phe 
335 

Ser Phe His Cys 
350 

Glu He Val Trp 
365 

Tyr Asp Val Val 



370 



375 



380 



Ser Asp His Val 
385 

Pro Arg Gly Lys 



Glu Cys His His 

420 

Asn lie Ser Cys 
435 

Trp Ser Thr Ser 
450 

Arg Tyr His Arg 
465 

- 

Pro lie Ser Glu 



Glu Cys He Phe 

500 

He Arg He Asn 

515 

Val Leu Pro Asp 
530 

Ala Glu He Thr 
545 

Pro Val Phe Pro 



Ser Gly Lys Glu 

580 

Ser Lys Ser Val 
595 

Val Gin Val Arg 
610 

Trp Ser Asn Pro 



Ser Lys Val Thr 
390 

Phe Thr Tyr Asp 
405 

Arg Tyr Ala Glu 



Glu Thr Asp Gly 

440 

Thr He Gin Ser 
455 

Ser Ser Leu Tyr 
470 

Pro Lys Asp Cys 
485 

Gin Pro He Phe 



His Ser Leu Gly 

520 

Ser Val Val Lys 

535 

He Asn He Gly 
550 

Glu Asn Asn Leu 
565 

Val Gin. Trp Lys 



Ser Leu Pro Val 

600 

Cys Lys Arg Leu 
615 

Ala Tyr Thr Val 



Phe Phe Asn Leu 
395 

Ala Val Tyr Cys 

410 

Leu Tyr Val He 
425 

Tyr Leu Thr Lys 



Leu Ala Glu Ser 

460 

Cys Ser Asp He 
475 

Tyr Leu Gin Ser 
490 

Leu Leu Ser Gly 
505 

Ser Leu Asp Ser 



Pro Leu Pro Pro 

540 

Leu Leu Lys He 
555 

Gin Phe Gin He 
570 

Met Tyr Glu Val 
5S5 

Pro Asp Leu Cys 



Asp Gly Leu Gly 

620 

Val Met Asp He 



Asn Glu Thr Lys 

400 

Cys Asn Glu His 
415 

Asp Val Asn He 
430 

Met Thr Cys Arg 
445 

Thr Leu Gin Leu 



Pro Ser He His 

480 

Asp Gly Phe Tyr 
495 

Tyr Thr Met Trp 
510 

Pro Pro Thr Cys 

525 

Ser Ser Val Lys 



Ser Trp Glu Lys 

560 

Arg Tyr Gly Leu 
575 

Tyr Asp Ala Lys 
590 

Ala Val Tyr Ala 
605 

Tyr Trp Ser Asn 



Lys Val Pro Met 



625 



630 



635 



640 



Arg Gly Pro Glu 



Glu Lys Asn Val 

660 

Leu Cys Ser Val 
675 

Gly Thr Trp Ser 
690 

Trp Thr Glu Gin 
705 

Gly Ala Ser Val 



Lys Val Asa lie 

740 

Cys Val lie Val 
755 

Tyr Phe He He 
770 

Trp Leu Arg He 

785 

Phe He Pro He 



Glu Gly Val Gly 

820 

He Glu Lys His 
835 

Tie He Ser Ser 
850 

Gin Arg Met Lys 
865 

Cys Ser Trp Ala 



Phe Trp Arg He 
645 

Thr Leu Leu Trp 



Gin Arg Tyr Val 

680 

Glu Asp Val Gly 
695 

Ala His Thr Val 
710 

Ala Asn Phe Asn 
725 

Val Gin Ser Leu 



Ser Trp He Leu 

760 

Glu Trp Lys Asn 

Ser Ser Ser Val 
790 

Glu Lys Tyr Gin 
805 

Lys Pro Lys lie 



Gin Ser Asp Ala 

840 

Ser He Leu Leu 
855 

Lys Leu Phe Trp 
870 

Gin Gly Leu Asn 



He Asn Gly Asp 
650 

Lys Pro Leu Met 
665 

lie Asn His His 



Asn His Thr Lys 

700 

Thr Val Leu Ala 
715 

Leu Thr Phe Ser 
730 

Ser Ala Tyr Pro 
745 

Ser Pro Ser Asp 



Leu Asn Glu Asp 

780 

Lys Lys Tyr Tyr 
795 

Phe Ser Leu Tyr 
810 

lie Asn Ser Phe 

825 

Gly Leu Tyr Vai 



Leu Gly Thr Leu 

860 

Glu Asp Vai Pro 
875 

Phe Gin Lys Arg 



Thr Met Lys Lys 
655 

Lys Asn Asp Ser 
670 

Thr Ser Cys Asn 
685 

Phe Thr Phe Leu 



He Asn Ser He 

720 

Trp Pro Met Ser 
735 

Leu Asn Ser Ser 
750 

Tyr Lys Leu Met 
765 

Gly Glu He Lys 



He His Asp His 

800 

Pro He Phe Met 
815 

Thr Gin Asp Asp 
830 

He Val Pro Vai 
845 

Leu He Ser His 



Asn Pro Lys Asn 

880 

Thr Asp He Leu 



acg aca ttt gtt tea aca gta aat tct tta get ttt caa caa ata gat 

Thr Thr Phe Val Ser Thr Val Asn Ser Leu Val Phe Gin Gin lie Asp 

130 135 140 

gca aac tgg aac ata cag tgc tgg eta aaa gga gac tta aaa tta ttc 

Ala Asn Trp Asn lie Gin Cys Trp Leu Lys Gly Asp Leu Lys Leu Phe 

145 150 IBS 160 

ate tgt tat gtg gag tea tta ttt aag aat eta ttc agg aat tat aac 

lie Cys Tyr Val Glu Ser Leu Phe Lys Asn Leu Phe Arg Asn Tyr Asn 

165 170 175 

tat aag gtc cat ctt tta tat gtt ctg cct gaa gtg tta gaa gat tea 

Tyr Lys Val His Leu Leu Tyr Val Leu Pro Glu Val Leu Glu Asp Ser 

180 185 190 

cct ctg gtt ccc caa aaa ggc agt ttt cag atg gtt cac tgc aat tgc 

Pro Leu Val Pro Gin Lys Gly Ser Phe Gin Met Val His Cys Asn Cys 

195 200 205 

agt gtt cat gaa tgt tgt gaa tgt ctt gtg cct gtg cca aca gee aaa 

Ser Val His Glu Cys Cys Glu Cys Leu Val Pro Val Pro Thr Ala Lys 

210 215 220 

etc aac gac act etc ctt atg tgt ttg aaa ate aca tct ggt gga gta 

Leu Asn Asp Thr Leu Leu Met Cys Leu Lys lie Thr Ser Gly Gly Val 

225 230 235 240 

att ttc egg tea cct eta atg tea gtt cag ccc ata aat atg gtg aag 

lie Phe Arg Ser Pro Leu Met Ser Val Gin Pro lie Asn Met Val Lys 

245 250 255 

cct gat cca cca tta ggt ttg cat atg gaa ate aca gat gat ggt aat 

Pro Asp Pro Pro Leu Gly Leu His Met Glu lie Thr Asp Asp Gly Asn 

260 265 270 

tta aag att tct tgg tec age cca cca ttg gta cca ttt cca ctt caa 

Leu Lys lie Ser Trp Ser Ser Pro Pro Leu Val Pro Phe Pro Leu Gin 

275 280 285 

tat caa gtg aaa tat tea gag aat tct aca aca gtt ate aga gaa get 

Tyr Gin Val Lys Tyr Ser Glu Asn Ser Thr Thr Val lie Arg Giu Ala 

290 295 300 

gac aag att gtc tea get aca tec ctg eta gta gac agt ata ctt cct 

Asp Lys He Val Ser Ala Thr Ser Leu Leu Val Asp Ser He Leu Pro 

305 310 315 320 



ggg tct teg tat gag gtt cag gtg agg ggc aag aga ctg gat ggc cca 1008 

Gly Ser Ser Tyr Glu Val Gin Val Arg Gly Lys Arg Leu Asp Gly Pro 

325 330 335 

gga ate tgg agt gac tgg agt act cct cgt gtc ttt acc aca caa gat 1056 

Gly He Trp Ser Asp Trp Ser Thr Pro Arg Val Phe Thr Thr Gin Asp 

340 345 350 

gtc ata tac ttt cca cct aaa att ctg aca agt gtt ggg tct aat gtt 1104 

Val He Tyr Phe Pro Pro Lys He Leu Thr Ser Val Gly Ser Asn Val 

355 360 365 

tct ttt cac tgc ate tat aag aag gaa aac aag att gtt ccc tea aaa 1152 

Ser Phe His Cys He Tyr Lys Lys Glu Asn Lys He Val Pro Ser Lys 

370 375 380 

gag att gtt tgg tgg atg aat tea get gag aaa att cct caa age cag 1200 

Glu He Val Trp Trp Met Asn Leu Ala Glu Lys He Pro Gin Ser Gin 

385 390 395 400 

tat gat gtt gtg agt gat cat gtt age aaa gtt act ttt ttc aat ctg 1248 

Tyr Asp Val Val Ser Asp His Val Ser Lys Val Thr Phe Phe Asn Leu 

405 410 415 

aat gaa acc aaa cct cga gga aag ttt acc tat gat gca gtg tac tgc 1296 

Asn Glu Thr Lys Pro Arg Gly Lys Phe Thr Tyr Asp Ala Val Tyr Cys 

420 425 430 

tgc aat gaa cat gaa tgc cat cat cgc tat get gaa tta tat gtg att 1344 

Cys Asn Glu His Glu Cys His His Arg Tyr Ala Glu Leu Tyr Val lie 

435 440 445 

gat gtc aat ate aat ate tea tgt gaa act gat ggg tac tta act aaa 1392 

Asp Val Asn He Asn He Ser Cys Glu Thr Asp Gly Tyr Leu Thr Lys 

450 455 460 

atg act tgc aga tgg tea acc agt aca ate cag tea ctt gcg gaa age 1440 

Met Thr Cys Arg Trp Ser Thr Ser Thr He Gin Ser Leu Ala Glu Ser 

465 470 475 480 

act ttg caa ttg agg tat cat agg age age ctt tac tgt tct gat att 1488 

Thr Leu Gin Leu Arg Tyr His Arg Ser Ser Leu Tyr Cys Ser Asp He 

485 490 495 

cca tct att eat ccc ata tct gag ccc aaa gat tgc tat ttg cag agt 1536 

Pro Ser lie His Pro He Ser Glu Pro Lys Asp Cys Tyr Leu Gin Ser 

500 505 510 



1632 



1680 



gat ggt ttt tat gaa tgc att ttc cag cca ate ttc eta tta tct ggc 1584 

Asp Gly Phe Tyr Glu Cys lie Phe Gin Pro lie Phe Leu Leu Ser Gly 

515 520 525 

tac aca atg tgg att agg ate aat cac tct eta ggt tea ctt gac tct 

Tyr Thr Met Trp lie Arg lie Asn His Ser Leu Gly Ser Leu Asp Ser 
530 535 540 

cca cca aca tgt gtc ctt cct gat tct gtg gtg aag cca ctg cct cca 

Pro Pro Thr Cys Val Leu Pro Asp Ser Val Val Lys Pro Leu Pro Pro 
545 550 555 560 

tec agt gtg aaa gca gaa att act ata aac att gga tta ttg aaa ata 1728 

Ser Ser Val Lys Ala Glu He Thr He Asn He Gly Leu Leu Lys He 

565 570 575 

tct tgg gaa aag cca gtc ttt cca gag aat aac ctt caa ttc cag att 1776 

Ser Trp Glu Lys Pro Val Phe Pro Glu Asn Asn Leu Gin Phe Gin He 

580 585 590 

cgc tat ggt tta agt gga aaa gaa gta caa tgg aag atg tat gag gtt 182 4 

Arg Tyr Gly Leu Ser Gly Lys Glu Val Gin Trp Lys Met Tyr Glu Val 

595 600 605 

tat gat gca aaa tea aaa tct gtc agt etc cca gtt cca gac ttg tgt 1872 

Tyr Asp Ala Lys Ser Lys Ser Val Ser Leu Pro Val Pro Asp Leu Cys 
610 615 620 

gca gtc tat get gtt cag gtg cgc tgt aag agg eta gat gga ctg gga 1920 

Ala Val Tyr Ala Val Gin Val Arg Cys Lys Arg Leu Asp Gly Leu Gly 
625 630 635 640 

tat tgg agt aat tgg age aat cca gec tac aca gtt gtc atg gat ata 1968 

Tyr Trp Ser Asn Trp Ser Asn Pro Ala Tyr Thr Val Val Met Asp He 

645 650 655 

aaa gtt cct atg aga gga cct gaa ttt tgg aga ata att aat gga gat 2016 

Lys Val Pro Met Arg Gly Pro Glu Phe Trp Arg He He Asn Gly Asp 

660 665 670 

act atg aaa aag gag aaa aat gtc act tta ctt tgg aag ccc ctg atg 2 064 

Thr Met Lys Lys Glu Lys Asn Val Thr Leu Leu Trp Lys Pro Leu Met 

675 680 685 

aaa aat gac tea ttg tgc agt gtt cag aga tat gtg ata aac cat cat 2112 

Lys Asn Asp Ser Leu Cys Ser Val Gin Arg Tyr Val He Asn His His 
690 695 700 



act tec tgc 
Thr Ser Cys 
705 

ttc act ttc 
Phe Thr Phe 



ate aat tea 
lie Asn Ser 



tgg cct atg 
Trp Pro Met 
755 

tta aac age 
Leu Asn Ser 
770 

tac aag eta 
Tyr Lys Leu 
785 

ggt gaa ata 
Gly Glu lie 



ate cat gat 
He His Asp 

cca ata ttt 
Pro He Phe 
835 

act caa gat 
Thr Gin Asp 
850 

att gtg cca 
He Val Pro 
865 

tta ata tea 
Leu He Ser 



aat gga aca 
Asn Gly Thr 
710 

ctg tgg aca 

Leu Trp Thr 
725 

att ggt get 
He Gly Ala 
740 

age aaa gta 
Ser Lys Val 

agt tgt gtg 
Ser Cys Val 

atg tat ttt 
Met Tyr Phe 
790 

aaa tgg ctt 
Lys Trp Leu 
805 

cat ttt ate 
His Phe lie 
820 

atg gaa gga 
Met Glu Gly 

gat att gaa 
Asp He Glu 

gta att att 
Val He He 
870 

cac caa aga 
His Gin Arg 
885 



tgg tea gaa 
Trp Ser Glu 



gag caa gca 
Glu Gin Ala 



tct gtt gca 
Ser Val Ala 
745 

aat ate gtg 
Asn He Val 
760 

att gtt tec 
He Val Ser 
775 

att att gag 
He He Glu 



aga ate tct 
Arg He Ser 

ccc att gag 
Pro He Glu 
825 

gtg gga aaa 
Val Gly Lys 
840 

aaa cac cag 
Lys His Gin 
855 

tec tct tee 
Ser Ser Ser 



atg aaa aag 
Met Lys Lys 



gat gtg gga 
Asp Val Gly 

715 

cat act gtt 

His Thr Val 
730 

aat ttt aat 
Asn Phe Asn 



cag tea etc 
Gin Ser Leu 



tgg ata eta 
Trp lie Leu 
780 

tgg aaa aat 
Trp Lys Asn 
795 

tea tct gtt 
Ser Ser Val 
810 

aag tac cag 
Lys Tyr Gin 

cca aag ata 
Pro Lys He 

agt gat gca 
Ser Asp Ala 
860 

ate tta ttg 
He Leu Leu 
875 

eta ttt tgg 
Leu Phe Trp 
830 



aat cac acg 
Asn Kis Thr 



acg gtt ctg 
Thr Val Leu 
735 

tta ace ttt 
Leu Thr Phe 
750 

agt get tat 
Ser Ala Tyr 
765 

tea ccc age 
Ser Pro Ser 

ctt aat gaa 
Leu Asn Glu 

aag aag tat 
Lys Lys Tyr 
815 

ttc agt ctt 
Phe Ser Leu 
830 

att aat agt 
lie Asn Ser 
845 

ggt tta tat 
Gly Leu Tyr 

ctt gga aca 
Leu Gly Thr 



gaa gat gtt 
Glu Asp Val 
895 



aaa 2160 

Lys 

720 

gec 2208 
Ala 



tea 2256 
Ser 

cct 2304 
Pro 

Asp 

gat 2400 

Asp 

800 

tat 2448 
Tyr 

tac 2496 
Tyr 

ttc 2544 
Phe 

gta 2592 
Val 

tta 2640 

Leu 

880 

ccg 2688 

Pro 



aac ccc aag aat tgt tec tgg gca caa gga ctt aat ttt cag aag aga 
Asn Pro Lys Asn Cys Ser Trp Ala Gin Gly Leu Asn Phe Gin Lys Arg 

900 90S 910 



acg gac att ctg gat cca ccg get aga gec acc atg acc age aag gtg 
Thr Asp He Leu Asp Pro Pro Ala Arg Ala Thr Met Thr Ser Lys Val 

915 920 925 

tac gac ccc gag cag agg aag agg atg ate acc ggc ccc cag tgg tgg 
Tyr Asp Pro Glu Gin Arg Lys Arg Met He Thr Gly Pro Gin Trp Trp 
930 935 940 

gec agg tgc aag cag atg aac gtg ctg gac age ttc ate aac tac tac 
Ala Arg Cys Lys Gin Met Asn Val Leu Asp Ser Phe He Asn Tyr Tyr 
945 950 955 960 

gac age gag aag cac gec gag aac gec gtg ate ttc ctg cac gyc aac 
Asp Ser Glu Lys His Ala Glu Asn Ala Val He Phe Leu His Gly Asn 

965 970 975 

gec get age age tac ctg tgg agg cac gtg gtg ccc cac ate gag ccc 
Ala Ala Ser Ser Tyr Leu Trp Arg Kis Val Val Pro His He Glu Pro 

980 985 990 

gtg gec agg tgc ate ate ccc gat ctg ate ggc atg ggc aag age ggc 
Val Ala Arg Cys He He Pro Asp Leu He Gly Met Gly Lys Ser Gly 
995 1000 1005 

aag age ggc aac ggc age tac agg ctg ctg gac cac tac aag tac ctg 
Lys Ser Gly Asn Gly Ser Tyr Arg Leu Leu Asp His Tyr Lys Tyr Leu 
1010 1015 1020 

acc gec tgg ttc gag etc ctg aac ctg ccc aag aag ate ate ttc gtg 
Thr Ala Trp Phe Glu Leu Leu Asn Leu Pro Lys Lys He lie Phe Val 
1025 1030 1035 1040 

ggc cac gac tgg ggc gec tgc ctg gec ttc cac tac age tac gag cac 
Gly His Asp Trp Gly Ala Cys Leu Ala Phe His Tyr Ser Tyr Glu His 

1045 1050 1055 

cag gac aag ate aag gec ate gtg cac gec gag age gtg gtg gac gtg 
Gin Asp Lys He Lys Ala He Val His Ala Glu Ser Val Val Asp Val 

1060 1065 1070 

ate gag age tgg gac gag tgg cca gac ate gag gag gac ate gec ctg 
lie Glu Ser Trp Asp Gin Trp Pro Asp He Glu Glu Asp He Ala Leu 
1075 1080 1035 



ate aag age gag gag ggc gag aag atg gtg ctg gag aac aac ttc ttc 3 312 

He Lys Ser Glu Glu Gly Glu Lys Met Val Leu Glu Asn Asn Phe Phe 

1090 1095 HOO 

gtg gag acc atg ctg ccc age aag ate atg aga aag etg gag ecc gag 3 3 60 

Val Glu Thr Met Leu Pro Ser Lys He Met Arg Lys Leu Glu Pro Glu 

1105 1110 H15 H20 

gag ttc gee gee tac ctg gag cce ttc aag gag aag ggc gag gtg aga 3408 

Glu Phe Ala Ala Tyr Leu Glu Pro Phe Lys Glu Lys Gly Glu Val Arg 

1125 1130 H35 

aga ccc acc ctg age tgg ccc aga gag ate ccc ctg gtg aag ggc ggc 3456 

Arg Pro Thr Leu Ser Trp Pro Arg Glu He Pro Leu Val Lys Gly Gly 

1140 1145 1150 

aag ccc gac gtg gcg cag ate gtg aga aac tac aac gec tac ctg aga 3 5 04 

Lys Pro Asp Val Val Gin He Val Arg Asn Tyr Asn Ala Tyr Leu Arg 

1155 1160 1165 

gee age gac gac ctg ccc aag atg ttc ate gag age gac ccc ggc ttc 3 552 

Ala Ser Asp Asp Leu Pro Lys Met Phe He Glu Ser Asp Pro Gly Phe 

1170 1175 1180 

ttc age aac gec ate gtg gag ggc gee aag aag ttc ecc aac acc gag 3600 

Phe Ser Asn Ala He Val Glu Gly Ala Lys Lys Phe Pro Asn Thr Glu 

1185 1190 1195 1200 

ttc gtg aag gtg aag ggc ctg cac ttc age cag gag gac gec ccc gac 3 648 

Phe Val Lys Val Lys Gly Leu His Phe Ser Gin Glu Asp Ala Pro Asp 

1205 1210 1215 

gag atg ggc aag tac ate aag age ttc gtg gag aga gtg ctg aag aac 3 696 

Glu Met Gly Lys Tyr He Lys Ser Phe Val Glu Arg Val Leu Lys Asn 

1220 1225 1230 

gag cag taa 37 0 5 
Glu Gin 

1235 



<210> 12 

<211> 1234 

<212> PRT 

<213> Artificial sequence 

<2 23> Artificial sequence description :0BR LUC 



<400> 12 
Met Val Leu Ala 
1 

Leu Leu Met Leu 

20 

Gin Glu Gin Lys 
35 

Thr Pro Trp Arg 

50 

Asp Tyr Phe Leu 
65 

Asn Gly His Tyr 



Thr His Phe Ser 

100 

Ser Glu Gin Asp 
115 

Thr Thr Phe Val 
130 

Ala Asn Trp Asn 

145 

I le Cys Tyr Val 



Tyr Lys Val His 

180 

Pro Leu Val Pro 
195 

Ser Val His Glu 

210 

Leu Asn Asp Thr 
225 

lie Phe Arg Ser 



Ser Ser Thr Thr 
5 

Phe His Leu Gly 



Leu lie Ser Glu 

40 

Phe Lys Leu Ser 
55 

Leu Pro Ala Gly 
70 

Glu Thr Ala Val 
85 

Asn Leu Ser Lys 



Arg Asn Cys Ser 

120 

Ser Thr Val Asn 
135 

lie Gin Cys Trp 

150 

Glu Ser Leu Phe 
165 

Leu Leu Tyr Val 



Gin Lys Gly Ser 

200 

Cys Cys Glu Cys 
215 

Leu Leu Met Cys 
230 

Pro Leu Met Ser 
245 



Ser lie His Thr 

10 

Leu Gin Ala Ser 
25 

Glu Asp Leu Thr 



Cys Met Pro Pro 

60 

Leu Ser Lys Asn 

75 

Glu Pro Lys Phe 

50 

Thr Thr Phe His 
105 

Leu Cys Ala Asp 



Ser Leu Val Phe 

140 

Leu Lys Gly Asp 
155 

Lys Asn Leu Phe 
170 

Leu Pro Glu Val 
185 

Phe Gin Met Val 



Leu Val Pro Val 

220 

Leu Lys lie Thr 

235 

Val Gin Pro lie 
250 



Met Leu Leu Leu 

15 

lie Ser Ala Arg 

30 

Arg Tyr Pro lie 

45 

Asn Ser Thr Tyr 



Thr Ser Asn Ser 

80 

Asn Ser Ser Gly 

95 

Cys Cys Phe Arg 

110 

Asn lie Glu Gly 
125 

Gin Gin lie Asp 



L eu Ly s Leu Ph e 

160 

Arg Asn Tyr Asn 
175 

Leu Glu Asp Ser 
190 

His Cys Asn Cys 
205 

Pro Thr Ala Lys 



Ser Gly Gly Val 

240 

Asn Me t Val Lys 
255 



Pro Asp Pro Pro 

260 

Leu Lys lie Ser 
275 

Tyr Gin Val Lys 
290 

Asp Lys He Val 
305 

Gly Ser Ser Tyr 



Gly He Trp Ser 

340 

Val He Tyr Phe 
355 

Ser Phe His Cys 
370 

Glu He Val Trp 
385 

Tyr Asp Val Val 



Asn Glu Thr Lys 

420 

Cys Asn Glu His 
435 

Asp Val Asn lie 
450 

Met Thr Cys Arg 
465 

Thr Leu Gin Leu 



Pro Ser lie His 

500 



Leu Gly Leu His 



Trp Ser Ser Pro 

280 

Tyr Ser Glu Asn 
295 

Ser Ala Thr Ser 
310 

Glu Val Gin Val 
325 

Asp Trp Ser Thr 



Pro Pro Lys He 

360 

He Tyr Lys Lys 
375 

Trp Met Asn Leu 
390 

Ser Asp His Val 
405 

Pro Arg Gly Lys 



Glu Cys His His 

440 

Asn lie Ser Cys 
455 

Trp Ser Thr Ser 

470 

Arg Tyr His Arg 
485 

Pro He Ser Glu 



Met Glu He Thr 
265 

Pro Leu Val Pro 



Ser Thr Thr Val 

300 

Leu Leu Val Asp 
315 

Arg Gly Lys Arg 
330 

Pro Arg Val Phe 
345 

Leu Thr Ser Val 



Glu Asn Lys He 

380 

Ala Glu Lys He 
395 

Ser Lys Val Thr 
410 

Phe Thr Tyr Asp 
425 

Arg Tyr Ala Glu 



Glu Thr Asp Gly 

460 

Thr lie Gin Ser 

475 

Ser Ser Leu Tyr 
490 

Pro Lys Asp Cys 
505 



Asp Asp Gly Asn 

270 

Phe Pro Leu Gin 
285 

He Arg Glu Ala 



Ser lie Leu Pro 

320 

Leu Asp Gly Pro 
335 

Thr Thr Gin Asp 
350 

Gly Ser Asn Val 
365 

Val Pro Ser Lys 



Pro Gin Ser Gin 

400 

Phe Phe Asn Leu 
415 

Ala Val Tyr Cys 
430 

Leu Tyr Val. He 
445 

Tyr Leu Thr Lys 



Leu Ala Glu Ser 

480 

Cys Ser Asp lie 
495 

Tyr Leu Gin Ser 
510 



Asp Gly Phe Tyr 
515 

Tyr Thr Met Trp 
530 

Pro Pro Thr Cys 
545 

Ser Ser Val Lys 



Ser Trp Glu Lys 

580 

Arg Tyr Gly Leu 
595 

Tyr Asp Ala Lys 
610 

Ala Val Tyr Ala 
625 

Tyr Trp Ser Asn 



Lys Val Pro Met 

660 

Thr Met Lys Lys 
675 

Lys Asn Asp Ser 
690 

Thr Ser Cys Asn 
705 

Phe Thr Phe Leu 



lie Asn Ser lie 

740 

Trp Pro Met Ser 
755 



Glu Cys lie Phe 

520 

lie Arg lie Asn 
535 

Val Leu Pro Asp 
550 

Ala Glu lie Thr 
565 

Pro Val Phe Pro 



Ser Gly Lys Glu 

600 

Ser Lys Ser Val 
615 

Val Gin Val Arg 
630 

Trp Ser Asn Pro 
645 

Arg Gly Pro Glu 

Glu Lys Asn Val 

680 

Leu Cys Ser Val 
695 

Gly Thr Trp Ser 
710 

Trp Thr Glu Gin 
725 

Gly Ala Ser Val 



Lys Val Asn lie 

760 



Gin Pro lie Phe 



His Ser Leu Gly 

540 

Ser Val Val Lys 
555 

He Asn He Gly 
570 

Glu Asn Asn Leu 
585 

Val Gin Trp Lys 



Ser Leu Pro Val 

620 

Cys Lys Arg Leu 
635 

Ala Tyr Thr Val 
650 

Phe Trp Arg He 
665 

Thr Leu Leu Trp 



Gin Arg Tyr Val 

700 

Glu Asp Val Gly 
715 

Ala His Thr Val 
730 

Ala Asn Phe Asn 
745 . 

Val Gin Ser Leu 



Leu Leu Ser Gly 
5 2 5 

Ser Leu Asp Ser 



Pro Leu Pro Pro 

560 

Leu Leu Lys He 
575 

Gin Phe Gin He 
590 

Met Tyr Glu Val 
605 

Pro Asd Leu Cys 



Asp Gly Leu Gly 

640 

Val Met Asp He 
655 

lie Asn Gly Asp 
670 

Lys Pro Leu Met 
685 

He Asn His His 



Asn His Thr Lys 

720 

Thr Val Leu Ala 
735 

Leu Thr Phe Ser 
750 

Ser Ala Tyr Pro 
765 



Thr Ala Trp Phe Glu Leu Leu Asn Leu Pro Lys Lys lie lie Phe Val 
025 1030 1035 1040 

Gly His Asp Trp Gly Ala Cys Leu Ala Phe His Tyr Ser Tyr Glu His 

1045 1050 1055 

Gin Asp Lys lie Lys Ala lie Val His Ala Glu Ser Val Val Asp Val 

1060 1065 1070 

lie Glu Ser Trp Asp Glu Trp Pro Asp lie Glu Glu Asp lie Ala Leu 
1075 1080 1085 

lie Lys Ser Glu Glu Gly Glu Lys Met Val Leu Glu Asn Asn Phe Phe 
1090 1095 1100 

Val Glu Thr Met Leu Pro Ser Lys lie Met Arg Lys Leu Glu Pro Glu 
105 1110 1115 1120 

Glu Phe Ala Ala Tyr Leu Glu Pro Phe Lys Glu Lys Gly Glu Val Arg 

1125 1130 1135 

Arg Pro Thr Leu Ser Trp Pro Arg Glu lie Pro Leu Val Lys Gly Gly 

1140 1145 1150 

Lys Pro Asp Val Val Gin lie Val Arg Asn Tyr Asn Ala Tyr Leu Arg 
1155 1160 1165 

Ala Ser Asp Asp Leu Pro Lys Met Phe lie Glu Ser Asp Pro Gly Phe 
1170 1175 1180 

Phe Ser Asn Ala Lie Val Glu Gly Ala Lys Lys Phe Pro Asn Thr Glu 
185 1190 1195 1200 

Phe Val Lys Val Lys Gly Leu His Phe Ser Gin Glu Asp Ala Pro Asp 

1205 1210 1215 

Glu Met Gly Lys Tyr lie Lys Ser Phe Val Glu Arg Val Leu Lys Asn 

1220 1225 1230 



Glu Gin 



<210> 13 

<211> 3486 

<212> DNA 

<213> Artificial sequence 



ate tgt tat gtg gag tea tta ttt aag aat eta ttc agg aat tat aac 528 

lie Cys Tyr Val Glu Ser Leu Phe Lys Asn Leu Phe Arg Asn Tyr Asn 

165 170 175 

tat aag gtc cat ctt tta tat gtt ctg cct gaa gtg tta gaa gat tea 576 

Tyr Lys Val His Leu Leu Tyr Val Leu Pro Glu Val Leu Glu Asp Ser 

180 185 190 

cct ctg gtt cce caa aaa ggc agt ttt cag atg gtt cac tgc aat tgc 624 

Pro Leu Val Pro Gin Lys Gly Ser Phe Gin Met Val His Cys Asn Cys 

195 200 205 

agt gtt cat gaa tgt tgt gaa tgt ctt gtg cct gtg oca aca gec aaa 672 

Ser Val His Glu Cys Cys Glu Cys Leu Val Pro Val Pro Thr Ala Lys 
210 215 220 

etc aac gac act etc ctt atg tgt ttg aaa ate aca tct ggt gga gta 720 

Leu Asn Asp Thr Leu Leu Met Cys Leu Lys lie Thr Ser Gly Gly Val 

225 ^ 230 235 240 

att ttc egg tea ect eta atg tea gtt cag ccc ata aat atg gtg aag 768 

lie Phe Arg Ser Pro Leu Met Ser Val Gin Pro lie Asn Met Val Lys 

245 250 255 

cct gat cca cca tta ggt ttg cat atg gaa ate aca gat gat ggt aat 816 

Pro Asp Pro Pro Leu Gly Leu His Met Glu lie Thr Asp Asp Gly Asn 

260 265 270 

tta aag att tct tgg tec age cca cca ttg gta cca ttt cca ctt caa 864 

Leu Lys lie Ser Trp Ser Ser Pro Pro Leu Val Pro Phe Pro Leu Gin 

275 280 285 

tat caa gtg aaa tat tea gag aat tct aca aca gtt ate aga gaa get 912 

Tyr Gin Val Lys Tyr Ser Glu Asn Ser Thr Thr Val lie Arg Glu Ala 

290 295 300 

gac aag att gtc tea get aca tec ctg eta gta gac agt ata ctt cct 960 

Asp Lys lie Val Ser .Ala Thr Ser Leu Leu Val Asp Ser lie Leu Pro 

305 310 315 320 

ggg tct teg tat gag gtt cag gtg agg ggc aag aga ctg gat gge cca 1008 

Gly Ser Ser Tyr Glu Val Gin Val Arg Gly Lys Arg Leu Asp Gly Pro 

325 330 335 

gga ate tgg agt gac tgg agt act cct cgt gtc ttt acc aca caa gat 1056 

Gly lie Trp Ser Asp Trp Ser Thr Pro Arg Val Phe Thr Thr Gin Asp 

340 345 350 



gtc ata tac ttt cca cct aaa att ctg aca agt gtt ggg tct aat gtt 
Val lie Tyr Phe Pro Pro Lys lie Leu Thr Ser Val Gly Ser Asn Val 
355 360 365 

tct ttt cac tgc ate tat aag aag gaa aac aag att gtt ccc tea aaa 
Ser Phe His Cys He Tyr Lys Lys Glu Asn Lys He Val Pro Ser Lys 

370 375 380 

gag att gtt tgg tgg atg aat tta get gag aaa att cct eaa age cag 
Glu He Val Trp Trp Met Asn Leu Ala Glu Lys He Pro Gin Ser Gin 
385 390 395 400 

tat gat gtt gtg agt gat cat gtt age aaa gtt act ttt ttc aat ctg 
Tyr Asp Val Val Ser Asp His Val Ser Lys Val Thr Phe Phe Asn Leu 

405 410 415 

aat gaa acc aaa cct cga gga aag ttt ace tat gat gca gtg tac tgc 
Asn Glu Thr Lys Pro Arg Gly Lys Phe Thr Tyr Asp Ala Val Tyr Cys 

420 425 430 

tgc aat gaa cat gaa tgc cat cat cgc tat get gaa tta tat gtg att 
Cys Asn Glu His Glu Cys His His Arg Tyr Ala Glu Leu Tyr Val He 
435 440 445 

gat gtc aat ate aat ate tea tgt gaa act gat ggg tac tta act aaa 
Asp Val Asn He Asn He Ser Cys Glu Thr Asp Gly Tyr Leu Thr Lys 
450 455 460 

atg act tgc aga tgg tea acc agt aca ate cag tea ctt gcg gaa age 
Met Thr Cys Arg Trp Ser Thr Ser Thr He Gin Ser Leu Ala Glu Ser 
465 470 475 480 

act ttg caa ttg agg tat cat agg age age ctt tac tgt tct gat att 
Thr Leu Gin Leu Arg Tyr His Arg Ser Ser Leu Tyr Cys Ser Asp He 

485 490 495 

cca tct att cat ccc ata tct gag ccc aaa gat tgc tat ttg cag agt 
Pro Ser He His Pro He Ser Glu Pro Lys Asp Cys Tyr Leu Gin Ser 

500 505 510 

gat ggt ttt tat gaa tgc att ttc cag cca ate ttc eta tta tct ggc 
Asp Gly Phe Tyr Glu Cys He Phe Gin Pro He Phe Leu Leu Ser Gly 

515 520 525 

tac aca atg tgg att agg ate aat cac tct eta ggt tea ctt gac tct 
Tyr Thr Met Trp He Arg He Asn His Ser Leu Gly Ser Leu Asp Ser 
530 535 540 



cca cca aca tgt gtc ctt cct gat tct gtg gtg aag cca ctg cct cca 1680 

Pro Pro Thr Cys Val Leu Pro Asp Ser Val Val Lys Pro Leu Pro Pro 

545 550 555 560 

tec agt gtg aaa gca gaa att act ata aac att gga tta ttg aaa ata 1728 

Ser Ser Val Lys Ala Glu He Thr He Asn He Gly Leu Leu Lys He 

565 570 575 

tct tgg gaa aag cca gtc ttt cca gag aat aac ctt caa ttc cag att 1776 

Ser Trp Glu Lys Pro Val Phe Pro Glu Asn Asn Leu Gin Phe Gin He 

580 585 590 

cgc tat ggt tta agt gga aaa gaa gta caa tgg aag atg tat gag gtt 1824 

Arg Tyr Gly Leu Ser Gly Lys Glu Val Gin Trp Lys Met Tyr Glu Val 

595 600 605 

tat gat gca aaa tea aaa tct gtc agt etc cca gtt cca gac ttg tgt 1372 

Tyr Asp Ala Lys Ser Lys Ser Val Ser Leu Pro Val Pro Asp Leu Cys 

610 615 620 

- 

gca gtc tat get gtt cag gtg cgc tgt aag agg eta gat gga ctg gga 1920 

Ala Val Tyr Ala Val Gin Val Arg Cys Lys Arg Leu Asp Gly Leu Gly 

625 630 635 640 

tat tgg agt aat tgg age aat cca gee tae aca gtt gtc atg gat ata 1963 

Tyr Trp Ser Asn Trp Ser Asn Pro Ala Tyr Thr Val Val Met Asp He 

645 650 655 

aaa gtt cct atg aga gga cct gaa ttt tgg aga ata att aat gga gat 2 016 

Lys Val Pro Met Arg Gly Pro Glu Phe Trp Arg He He Asn Gly Asp 

660 665 670 

act atg aaa aag gag aaa aat gtc act tta ctt tgg aag ccc ctg atg 2 064 

Thr Met Lys Lys Glu Lys Asn Val Thr Leu Leu Trp Lys Pro Leu Met 

675 680 685 

aaa aat gac tea ttg tgc agt gtt cag aga tat gtg ata aac cat cat 2112 

Lys Asn Asp Ser Leu Cys Ser Val Gin Arg Tyr Val He Asn His His 

690 695 700 

act tec tgc aat gga aca tgg tea gaa gat gtg gga aat cac acg aaa 2160 

Thr Ser Cys Asn Gly Thr Trp Ser Glu Asp Val Gly Asn His Thr Lys 

705 710 715 720 

ttc act ttc ctg tgg aca gag caa gca cat act gtt acg gtt ctg gec 2208 

Phe Thr Phe Leu Trp Thr Glu Gin Ala His Thr Val Thr Val Leu Ala 

725 730 735 



ate aat tea att ggt get tct gtt gca aat ttt aat tta acc ttt tea 
lie Asn Ser lie Gly Ala Ser Val Ala Asn Phe Asn Leu Thr Phe Ser 

740 745 750 

tgg cet atg age aaa gta aac ate gtg cag tea etc agt get tat cct 
Trp Pro Met Ser Lys Val Asn He Val Gin Ser Leu Ser Ala Tyr Pro 

755 760 765 

tta aac age agt tgt gtg att gtt tec tgg ata eta tea ccc agt gat 
Leu Asn Ser Ser Cys Val He Val Ser Trp He Leu Ser Pro Ser Asp 
770 775 780 

tac aag eta atg tat ttt att att gag tgg aaa aat ctt aat gaa gat 
Tyr Lys Leu Met Tyr Phe lie He Glu Trp Lys Asn Leu Asn Glu Asp 
785 790 795 800 

ggt gaa ata aaa tgg ctt aga ate tct tea tct gtt aag aag tat tat 
Gly Glu He Lys Trp Leu Arg He Ser Ser Ser Val Lys Lys Tyr Tyr 

805 810 815 

ate cat gat cat ttt ate ccc att gag aag tac cag ttc agt ett tac 
He His Asp His Phe lie Pro He Glu Lys Tyr Gin Phe Ser Leu Tyr 

820 825 830 

cca ata ttt atg gaa gga gtg gga aaa cca aag ata att aat agt ttc 
Pro He Phe Met Glu Gly Val Gly Lys Pro Lys He He Asn Ser Phe 
835 840 845 

act caa gat gat att gaa aaa cac cag agt gat: gca ggt tta tat gta 
Thr Gin Asp Asp He Glu Lys His Gin Ser Asp Ala Gly Leu Tyr Val 

850 855 860 

att gtg cca gta att att tec tct tec ate tta ttg ctt gga aca tta 
He Val Pro Val He He Ser Ser Ser He Leu Leu Leu Gly Thr Leu 
865 870 875 880 

tta ata tea cac caa aga atg aaa aag eta ttt tgg gaa gat gtt cog 
Leu He Ser His Gin Arg Met Lys Lys Leu Phe Trp Glu Asp Val Pro 

885 890 895 

aac ccc aag aat tgt tec tgg gca caa gga ctt aat ttt cag aag aga 
Asn Pro Lys Asn Cys Ser Trp Ala Gin Gly Leu Asn Phe Gin Lys Arg 

900 905 910 

acg gac att ctg gat cca ccg gte gee acc atg gtg age aag ggc gag 
Thr Asp He Leu Asp Pro Pro Val Ala Thr Met Val Ser Lys Gly Glu 
915 920 925 



gag ctg ttc acc ggg gtg gtg ccc ate ctg gtc gag ctg sac ggc gac 2832 
Glu Leu Phe Thr Gly Val Val Pro He Leu Val GIu Leu Asp Gly Asp 
930 935 940 



tac cag cag aac acc ccc ate ggc gac ggc ccc gtg ctg ctg ccc gac 
Tyr Gin Gin Asn Thr Pro lie Gly Asp Gly Pro Val Leu Leu Pro Asp 
1105 HIO H15 1120 



2928 



2976 



If!? A 



3072 



gta aac gac cac aag ttc age gtg tec ggc gag ggc gag ggc gat gec 2 880 
Vai Asn Gly His Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala 
945 950 955 960 

acc tac ggc aag ctg acc ctg aag ttc ate tgc acc acc ggc aag ctg 
Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys Thr Thr Gly Lys Leu 

965 970 975 

ccc gtg ccc tgg ccc acc etc gtg acc acc ttc ggc tac ggc gtg cag 
Pro Val Pro Trp Pro Thr Leu Val Thr Thr Phe Gly Tyr Gly Val Gin 

980 ' 985 990 

tgc ttc gec cgc tac ccc gac cac atg cgc cag cac gac ttc ttc aag 
Cys Phe Ala Arg Tyr Pro Asp His Met Arg Gin His Asp Phe Phe Lys 
995 1000 1005 

tec gec atg ccc gaa ggc tac gtc cag gag cgc acc ate ttc ttc aag 
Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg Thr He Phe Phe Lys 
1010 1015 1020 

gac gac ggc aac tac aag acc cgc gee gag gtg aag ttc gag ggc gac 3120 
Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp 
1025 1030 1035 1040 

acc ctg gtg aac cgc ate gag ctg aag ggc ate gac ttc aag gag gac 3168 
Thr Leu Val. Asn Arg He Glu Leu Lys Gly He Asp Phe Lys Glu Asp 

1045 1050 1055 

ggc aac ate ctg ggg cac aag ctg gag tac aac tac aac age cac aac 3216 
Gly Asn He Leu Gly His Lys Leu Giu Tyr Asn Tyr Asn Ser His Asn 

1060 1065 1070 

gtc tat ate atg gee gac aag cag aag aac ggc ate aag gtg aac ttc 3264 
Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly He Lys Val Asn Phe 
1075 1080 1085 

aag ate cgc cac aac ate gag gac ggc age gtg cag etc gee gac cac 3312 
Lys He Arg His Asn He Glu Asp Gly Ser Val Gin Leu Ala Asp His 
1090 1095 HOO 



3360 



aac cac tac ctg age tac cag tec gec ctg age aaa gae ccc aac gag 
Asn His Tyr Leu Ser Tyr Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu 

1125 H30 1135 

aag cgc gat cae atg gtc ctg ctg gag ttc gtg acc gee gec ggg ate 
Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly He 

H40 H45 H50 

act etc ggc atg gac gag etg tac aag taa 
Thr Leu Gly Met Asp Glu Leu Tyr Lys 
1155 1160 



<210> 14 

<2I1> 1161 

<212> PRT 

<213> Artificial sequence 

<223> Artificial sequence description ;0BR YFP 

<400> 14 

Met Val Leu Ala Ser Ser Thr Thr Ser lie Kis Thr Met Leu Leu Leu 
15 10 15 

Leu Leu Met Leu Phe His Leu Gly Leu Gin Ala Ser He Ser Ala Arg 

20 25 30 

Gin Glu Gin Lys Leu He Ser Glu Glu Asp Leu Thr Arg Tyr Pro He 

35 40 45 

Thr Pro Trp Arg Phe Lys Leu Ser Cys Met Pro Pro Asn Ser Thr Tyr 
50 55 60 

Asp Tyr Phe Leu Leu Pro Ala Gly Leu Ser Lys Asn Thr Ser Asn Ser 
65 70 75 80 

Asn Gly His Tyr Glu Thr Ala Val Glu Pro Lys Phe Asn Ser Ser Gly 

85 90 95 

Thr His Phe Ser Asn Leu Ser Lys Thr Thr Phe His Cys Cys Phe Arg 

100 105 HO 

Ser Glu Gin Asp Arg Asn Cys Ser Leu Cys Ala Asp Asn He Glu Gly 
115 120 125 



Thr Thr Phe Val Ser Thr Val Asn Ser Leu Val Phe Gin Gin lie Asp 
130 135 140 



Ala Asn Trp Asn lie Gin Cys Trp Leu Lys Gly Asp Leu Lys Leu Phe 
145 150 155 160 

He Cys Tyr Val Glu Ser Leu Phe Lys Asn Leu Phe Arg Asn Tyr Asn 

165 170 175 

Tyr Lys Val His Leu Leu Tyr Val Leu Pro Glu Val Leu Glu Asp Ser 

180 185 ISO 

Pro Leu Val Pro Gin Lys Gly Ser Phe Gin Met Val His Cys Asn Cys 

195 200 205 

Ser Val His Glu Cys Cys Glu Cys Leu Val Pro Val Pro Thr Ala Lys 
210 215 220 

Leu Asn Asp Thr Leu Leu Met Cys Leu Lys He Thr Ser Gly Gly Val 
225 230 235 240 

He Phe Arg Ser Pro Leu Met Ser Val Gin Pro He Asn Met Val Lys 

245 250 255 

Pro Asp Pro Pro Leu Gly Leu His Met Glu He Thr Asp Asp Gly Asn 

260 265 270 

Leu Lys He Ser Trp Ser Ser Pro Pro Leu Val Pro Phe Pro Leu Gin 
275 280 285 

Tyr Gin Val Lys Tyr Ser Glu Asn Ser Thr Thr Val He Arg Glu Ala 
290 295 300 

Asp Lys He Val Ser Ala Thr Ser Leu Leu Val Asp Ser He Leu Pro 
305 310 315 320 

Gly Ser Ser Tyr Glu Val Gin Val Arg Gly Lys Arg Leu Asp Gly Pro 

325 330 335 

Gly He Trp Ser Asp Trp Ser Thr Pro Arg Val Phe Thr Thr Gin Asp 

340 345 350 

Val He Tyr Phe Pro Pro Lys He Leu Thr Ser Vai Gly Ser Asn Val 
355 360 365 

Ser Phe His Cys He Tyr Lys Lys Glu Asn Lys lie Val Pro Ser Lys 

370 375 380 

Glu He Val Trp Trp Met Asn Leu Ala Glu Lys He Pro Gin Ser Gin 
385 390 395 400 



Tyr Asp Val Val Ser Asp His Val Ser Lys Val Thr Phe Phe Asn Leu 

405 410 415 

Asn Glu Thr Lys Pro Arg Gly Lys Phe Thr Tyr Asp Ala Val Tyr Cys 

420 425 430 

Cys Asn Glu His Giu Cys Kis His Arg Tyr Ala Glu Leu Tyr Val lie 
435 440 445 

Asp Val Asn lie Asn He Ser Cys Glu Thr Asp Giy Tyr Leu Thr Lys 
450 455 460 

Met Thr Cys Arg Trp Ser Thr Ser Thr He Gin Ser Leu Ala Glu Ser 
465 470 



475 480 

*-v — -w- 1\ i-i-r^ T "1 ££s 



Thr Leu Gin Leu Arg Tyr His Arg Ser Ser Leu Tyr uys 

485 



490 495 



Pro Ser He His Pro He Ser Giu Pro Lys Asp Cys Tyr Leu Gin Ser 

500 505 510 

Asp Gly Phe Tyr Glu Cys lie Phe Gin Pro He Phe Leu Leu Ser Gly 
515 520 525 

Tyr Thr Met Trp He Arg He Asn His Ser Leu Gly Ser Leu Asp Ser 
530 535 5 40 



Pro Pro Thr Cys Val Leu Pro Asp Ser Val Val Lys Pro Leu Pro Pro 
545 



550 555 560 



Ser Ser Val Lys Ala Glu He Thr He Asn He Gly Leu Leu Lys He 



™* O 



570 575 



Ser Trp Glu Lys Pro Val Phe Pro Glu Asn Asn Leu Gin Phe Gin lie 

580 585 590 

Arg Tyr Gly Leu Ser Gly Lys Glu Val Gin Trp Lys Met Tyr Glu Val 
595 600 605 

Tyr Asp Ala Lys Ser Lys Ser Val Ser Leu Pro Val Pro Asp Leu Cys 
610 615 620 



Ala Val Tyr Ala Val Gin Val Arg Cys Lys Arg Leu Asp Gly Leu Gly 
625 



630 635 640 



•Tyr Trp Ser Asn Trp Ser Asn Pro Ala Tyr Thr Val Val Met Asp He 

645 650 655 



Lys Val Pro Met 

660 

Thr Met Lys Lys 
67 5 

Lys Asn Asp Ser 
690 

Tbr Ser Cys Asn 

705 

Phe Thr Phe Leu 



lie Asn Ser lie 

740 

Trp Pro Met Ser 

755 

Leu Asn Ser Ser 
770 

Tyr Lys Leu Met 
785 

Gly Glu lie Lys 



Arg Gly Pro Glu 



Glu Lys Asn Val 

680 

Leu Cys Ser Val 
695 

Gly Thr Trp Ser 
710 

Trp Thr Glu Gin 
725 

Gly Ala Ser Val 



Lys Val Asn lie 

760 

Cys Val He Val 
7 7 5 

Tyr Phe He He 
790 

Trp Leu Arg He 
805 



Phe Trp Arg He 
665 

Thr Leu Leu Trp 



Gin Arg Tyr Val 

700 

Glu Asp Val Gly 
715 

Ala His Thr Val 
730 

Ala Asn Phe Asn 
745 

Val Gin Ser Leu 



Ser Trp He Leu 

780 

Glu Trp Lys Asn 
795 

Ser Ser Ser Val 
810 



He Asn Gly Asp 
670 

Lys Pro Leu Met 

685 

He Asn His His 



Asn His Thr Lys 

720 

Thr Val Leu Ala 
735 

Leu Thr Phe Ser 
750 

Ser Ala Tyr Pro 
765 

Ser Pro Ser Asp 



Leu Asn Glu Asp 

800 

Lys Lys Tyr Tyr 
815 



He His Asp His 

o n n 

O £ KJ 

Pro He Phe Met 
835 

Thr Gin Asp Asp 
850 

He Val Pro Val 
865 

Leu He Ser His 



Asn Pro Lys Asn 

900 



Phe He Pro lie 



Glu Gly Val Gly 

840 

He Glu Lys His 
855 

He lie Ser Ser 
870 

Gin Arg Met Lys 
885 

Cys Ser Trp Ala 



Glu Lys Tyr Gin 
825 

Lys Pro Lys He 



Gin Ser Asp Ala 

860 

Sex lie Leu Leu 
875 

Lys Leu Phe Trp 
890 

Gin Gly Leu Asn 
905 



Phe Ser Leu Tyr 

Lie Asn Ser Phe 
845 

Gly Leu Tyr Val 



Leu Gly Thr Leu 

880 

Glu Asp Val Pro 
895 

Phe Gin Lys Arg 
910 



Thr Asp He Leu Asp Pro Pro Val Ala Thr Met Val Ser Lys Gly Glu 
915 920 925 



Glu Leu Phe Thr Gly Val Val Pro He Leu Val Glu Leu Asp Gly Asp 
930 935 940 

Val Asn Gly His Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala 
945 950 955 960 

Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys Thr Thr Gly Lys Leu 

965 970 975 

Pro Val Pro Trp Pro Thr Leu Val Thr Thr Phe Gly Tyr Gly Val Gin 

980 985 990 

_ , -. . m ~ t. t t 4 Kir^ +- 7v ^s-e r 1 ! vt II A c Ti DVi e=> "P"h (=> T iVR 

CyS Pfie A±a Arg Tyi nu t\i=,y nia ncs, u-i-ii — j — 

995 1000 1005 

Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg Thr He Phe Phe Lys 
1010 _ 1015 1020 

Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp 
025 1030 1035 1040 

Thr Leu Val Asn Arg He Glu Leu Lys Gly He Asp Phe Lys Glu Asp 

1045 1050 1055 

Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn 

1060 1065 1070 

Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly lie Lys Val Asn Phe 
1075 1080 1085 

Lys He Arg His Asn He Glu Asp Gly Ser Val Gin Leu Ala Asp His 
1090 1095 1100 

Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro Val Leu Leu Pro Asp 
105 IHO H15 H20 

Asn His Tyr Leu Ser Tyr Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu 

1125 1130 H35 

Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly He 

1140 1145 H50 



Thr Leu Gly Met Asp Glu Leu Tyr Lys 
1155 H60 



<210> 16 

<211> 131 

<212> PRT 

<213> Homo sapiens 

<400> 16 

Met Ala Gly He Lys Ala Leu He Ser Leu Ser Phe Gly Gly Ala He 

Gly Leu Met Phe Leu Met Leu Gly Cys Ala Leu Pro He Tyr Asn Lys 

20 25 30 

Tyr Trp Pro Leu Phe Val Leu Phe Phe Tyr He Leu Ser Pro lie Pro 
35 40 45 

Tyr Cys He Ala Arg Arg Leu Val Asp Asp Thr Asp Ala Met Ser Asn 
50 55 60 

Ala Cys Lys Glu Leu Ala He Phe Leu Thr Thr Gly He Val Val Ser 
65 70 75 80 

Ala Phe Gly Leu Pro He Val Phe Ala Arg Ala His Leu He Glu Trp 

85 90 95 

Gly Ala Cys Ala Leu Val Leu Thr Gly Asn Thr Val He Phe Ala Thr 

100 105 HO 

He Leu Gly Phe Phe Leu Val Phe Gly Ser Asn Asp Asp Phe Ser Trp 
115 120 125 



Gin Gin Trp 
130 



<210> 17 

<211> 1359 

<212> DNA 

<213> Artificial sequence 

<220> 

<221> CDS 

<222> (1) . . (1359) 

<220> 

<223> Artificial sequence description : MY47 LUC 



<400> 17 

atg gca ggc ate aaa get ttg att agt ttg tec ttt gga gga gca ate 48 

Met Ala Gly lie Lys Ala Leu lie Ser Leu Ser Phe Gly Gly Ala lie 

15 10 15 

gga ctg atg ttt ttg atg ctt gga tgt gee ctt cca ata tac aac aaa 96 

Gly Leu Met Phe Leu Met Leu Gly Cys Ala Leu Pro lie Tyr Asn Lys 

20 25 30 

tae tgg ccc etc ttt gtt eta ttt ttt tac ate ctt tea cct att cca 144 

Tyr Trp Pro Leu Phe Val Leu Phe Phe Tyr lie Leu Ser Pro lie Pro 

3 5 4 0 4 5 

tac tgc ata gca aga aga tta gtg gat gat aca gat get atg agt aac 192 

Tyr Cys He Ala Arg Arg Leu Val Asp Asp Thr Asp Ala Met Ser Asn 

50 55 60 

get tgt aag gaa ctt gee ate ttt ctt aca acg ggc att gtc gtg tea 240 

Ala Cys Lys Glu Leu Ala He Phe Leu Thr Thr Gly He Val Val Ser 

65 . 70 75 80 

get ttt gga etc cct att gta ttt gee aga gca cat ctg att gag tgg 288 

Ala Phe Gly Leu Pro He Val Phe Ala Arg Ala His Leu He Glu Trp 

85 90 95 

gga get tgt gca ctt gtt etc aca gga aac aca gtc ate ttt gca act 33 6 

Gly Ala Cys Ala Leu Val Leu Thr Gly Asn Thr Val He Phe Ala Thr 

100 105 110 

ata eta ggc ttt ttc ttg gtc ttt gga age aat gac gac ttc age tgg 384 

He Leu Gly Phe Phe Leu Val Phe Gly Ser Asn Asp Asp Phe Ser Trp 

115 120 125 

cag cag tgg cga ccg gtg gat cca ccg get aga gee ace atg ace age 432 

Gin Gin Trp Arg Pro Val Asp Pro Pro Ala Arg Ala Thr Met Thr Ser 

130 135 140 

aag gtg tac gac ccc gag cag agg aag agg atg ate acc ggc ccc cag 480 

Lys Val Tyr Asp Pro Glu Gin Arg Lys Arg Met He Thr Gly Pro Gin 

145 150 155 160 

tgg tgg gec agg tgc aag cag atg aac gtg ctg gac age ttc ate aac 528 

Trp Trp Ala Arg Cys Lys Gin Met Asn Val Leu Asp Ser Phe He Asn 

165 170 175 

tac tac gac age gag aag cac gee gag aac gec gtg ate ttc ctg cac 576 

Tyr Tyr Asp Ser Glu Lys His Ala Glu Asn Ala Val He Phe Leu His 

180 185 190 



ggc aac gcc get age 
Gly Asn Ala Ala Ser 
195 

gag ccc gtg gcc agg 
Glu Pro Val Ala Arg 

210 

age ggc aag age ggc 
Ser Gly Lys Ser Gly 

225 

tac ctg acc gcc tgg 
Tyr Leu Thr Ala Trp 

245 

ttc gtg ggc cac gac 
Phe Val Gly His Asp 

260 

gag cac cag gac aag 
Glu His Gin Asp Lys 
27S 

gac gtg ate gag age 
Asp Val lie Glu Ser 
290 

gcc ctg ate aag age 
Ala Leu He Lys Ser 

305 

ttc ttc gtg gag acc 
Phe Phe Val Glu Thr 

325 

ccc gag gag ttc gcc 
Pro Glu Glu Phe Ala 

340 

gtg aga aga ccc acc 
Val Arg Arg Pro Thr 

355 

ggc ggc aag ccc gac 
Gly Gly Lys Pro Asp 
370 



age tac ctg tgg agg cac 
Ser Tyr Leu Trp Arg His 
200 

tgc ate ate ccc gat ctg 
Cys He lie Pro Asp Leu 
215 

aac ggc age tac agg ctg 
Asn Gly Ser Tyr Arg Leu 
230 235 

ttc gag etc ctg aac ctg 
Phe Glu Leu Leu Asn Leu 

250 

tgg ggc gcc tgc ctg gcc 
Trp Gly Ala Cys Leu Ala 

265 

ate aag gcc ate gtg cac 
He Lys Ala lie Val His 
280 

tgg gac gag tgg cca gac 
Trp Asp Glu Trp Pro Asp 
295 

gag gag ggc gag aag atg 
Glu Glu Gly Glu Lys Met 
310 315 

atg ctg ccc age aag ate 
Met Leu Pro Ser Lys He 

330 

gcc tac ctg gag ccc etc 
Ala Tyr Leu Glu Pro Phe 

345 

ctg age tgg ccc aga gag 
Leu Ser Trp Pro Arg Glu 

360 

gtg gtg cag ate gtg aga 
Val Val Gin He Val Arg 
375 



gtg gtg ccc cac ate 
Val Val Pro His He 
205 

ate ggc atg ggc aag 

He Gly Met Gly Lys 
220 

ctg gac cac tac aag 
Leu Asp His Tyr Lys 

240 

ccc aag aag ate ate 
Pro Lys Lys He He 

255 

ttc cac tac age tac 
Phe His Tyr Ser Tyr 

270 

gcc gag age gtg gtg 
Ala Glu Ser Val Val 
285 

ate gag gag gac ate 
He Glu Glu Asp He 
300 

gtg ctg gag aac aac 
Val Leu Glu Asn Asn 

320 

atg aga aag ctg gag 
Met Arg Lys Leu Glu 

335 

aag gag aag ggc gag 
Lys Glu Lys Gly Glu 
350 

ate ccc ctg gtg aag 
lie Pro Leu Val Lys 
365 

aac tac aac gcc tac 
Asn Tyr Asn Ala Tyr 
380 



ctg aga gcc age gac gac ctg ccc aag atg ttc ate gag age gac ccc 1200 
Leu Arg Ala Ser Asp Asp Leu Pro Lys Met Phe He Glu Ser Asp Pro 
335 390 395 400 

ggc ttc ttc age aac gcc ate gtg gag ggc gcc aag aag ttc ccc aac 1248 
Gly Phe Phe Ser Asn Ala He Val Glu Gly Ala Lys Lys Phe Pro Asn 

405 410 415 

ace gag ttc gtg aag gtg aag ggc ctg cac ttc age cag gag gac gcc 1296 
Thr Glu Phe Val Lys Val Lys Gly Leu His Phe Ser Gin Glu Asp Ala 

420 425 430 

ccc gac gag atg ggc aag tac ate aag age ttc gtg gag aga gtg ctg 13 44 
Pro Asp Glu Met Gly Lys Tyr He Lys Ser Phe Val Glu Arg Val Leu 
435 440 445 



aag aac gag cag taa 
Lys Asn Glu Gin 
450 . 



*~+ tn~ /\ 
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<210> 18 

<211> 452 

<212> PET 

<213> Artificial sequence 

<223> Artificial sequence description ; MY47 LUC 

<400> 18 

Met Ala Gly He Lys Ala Leu He Ser Leu Ser Phe Gly Gly Ala He 
15 10 15 

Gly Leu Met Phe Leu Met Leu Gly Cys Ala Leu Pro He Tyr Asn Lys 

20 25 30 

Tyr Trp Pro Leu Phe Val Leu Phe Phe Tyr lie Leu Ser Pro He Pro 
35 40 45 

Tyr Cys He Ala Arg Arg Leu Val Asp Asp Thr Asp Ala Met Ser Asn 
50 55 60 

Ala Cys Lys Glu Leu Ala lie Phe Leu Thr Thr Gly He Val Val Ser 

65 70 75 80 

Ala Phe Gly Leu Pro He Val Phe Ala Arg Ala His Leu He Glu Trp 

85 90 95 

Gly Ala Cys Ala Leu Val Leu Thr Gly Asn Thr Val He Phe Ala Thr 



100 



105 



110 



lie Leu Gly Phe Phe Leu Val Phe Gly Ser Asn Asp Asp Phe Ser Trp 

115 120 125 

Gin Gin Trp Arg Pro Val Asp Pro Pro Ala Arg Ala Thr Met Thr Ser 

130 135 140 

Lys Val Tyr Asp Pro Glu Gin Arg Lys Arg Met He Thr Gly Pro Gin 
145 150 155 160 

Trp Trp Ala Arg Cys Lys Gin Met Asn Val Leu Asp Ser Phe He Asn 

165 170 175 

Tyr Tyr Asp Ser Glu Lys His Ala Glu Asn Ala Val He Phe Leu His 

180 185 190 

Gly Asn Ala Ala Ser Ser Tyr Leu Trp Arg Kis Val Val Pro His He 
195 200 205 

Glu Pro Val Ala Arg Cys He He Pro Asp Leu He Gly Met Gly Lys 
210 215 220 

Ser Gly Lys Ser Gly Asn Gly Ser Tyr Arg Leu Leu Asp His Tyr Lys 
225 230 235 240 

Tyr Leu Thr Ala Trp Phe Glu Leu Leu Asn Leu Pro Lys Lys He He 

245 250 255 

Phe Val Gly His Asp Trp Gly Ala Cys Leu Ala Phe His Tyr Ser Tyr 

260 265 270 

Glu His Gin Asp Lys He Lys Ala He Val His Ala Glu Ser Val Val 

275 280 285 

Asp Val He Glu Ser Trp Asp Glu Trp Pro Asp He Glu Glu Asp He 

290 295 300 

Ala Leu He Lys Ser Glu Glu Gly Gin Lys Met Val Leu Glu Asn Asn 
305 310 315 320 

Phe Phe Val Glu Thr Met Leu Pro Ser Lys He Met Arg Lys Leu Glu 

325 330 335 

Pro Glu Glu Phe Ala Ala Tyr Leu Glu Pro Phe Lys Glu Lys Gly Glu 

340 345 350 



Va 



1 Arg Arg Pro Thr Leu Ser Trp Pro Arg Glu He Pro Leu Val Lys 



Tyr Cys lie Ala Arg Arg Leu Val Asp Asp Thr Asp Ala Met Ser Asn 
50 55 60 

get tgt aag gaa ctt gec ate ttt ctt aca acg ggc att gtc gtg tea 240 
Ala Cys Lys Glu Leu Ala lie Phe Leu Thr Thr Gly lie Val Val Ser 
65 70 75 80 

get ttt gga etc cct att gta ttt gee aga gca cat ctg att gag tgg 288 
Ala Phe Gly Leu Pro He Val Phe Ala Arg Ala His Leu He Glu Trp 

85 90 95 

gga get tgt gca ctt gtt etc aca gga aac aca gtc ate ttt gca act 336 
Gly Ala Cys Ala Leu Val Leu Thr Gly Asn Thr Val He Phe Ala Thr 

100 105 HO 

ata eta ggc ttt ttc ttg gtc ttt gga age aat gac gac ttc age tgg 3 84 
He Leu Gly Phe Phe Leu Val Phe Gly Ser Asn Asp Asp Phe Ser Trp 
115 120 125 

cag cag tgg cga ccg gtg gat cca ccg gtc gee acc atg gtg age aag 432 
Gin Gin Trp Arg Pro Val Asp Pro Pro Val Ala Thr Met Val Ser Lys 

130 135 140 

ggc gag gag ctg ttc acc ggg gtg gtg ccc ate ctg gtc gag ctg gac 480 
Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu Val Glu Leu Asp 
145 150 155 160 

ggc gac gta aac ggc cac aag ttc age gtg tec ggc gag ggc gag ggc 52 8 
Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu Gly Glu Gly 

165 170 175 

gat gec acc tac ggc aag ctg acc ctg aag ttc ate zgc acc acc ggc 576 
Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys Thr Thr Gly 

180 185 190 

aag ctg ccc gtg ccc tgg ccc acc etc gtg acc acc etc ggc tac ggc 624 
Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Phe Gly Tyr Gly 

195 200 205 

gtg cag tgc ttc gec cgc tac ccc gac cac atg cgc cag cac gac ttc 672 
Val Gin Cys Phe Ala Arg Tyr Pro Asp His Met Arg Gin His Asp Phe 

210 215 220 

ttc aag tec gec atg ccc gaa ggc tac gtc cag gag cgc acc ate ttc 72 0 
Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg Thr He Phe 
225 230 235 240 

ttc aag gac gac ggc aac tac aag acc cgc gec gag gtg aag ttc gag 768 



816 



Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu 

245 250 255 

ggc gac acc ctg gtg aac cgc ate gag ctg aag ggc ate gac ttc aag 
Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly He Asp Phe Lys 

260 265 270 

gag gac ggc aac ate ctg ggg cac aag ctg gag tac aac tac aac age 
Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser 
275 280 285 

cac aac gtc tat ate atg gec gac aag cag aag aac ggc ate aag gtg 9 ±2 
His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly He Lys Val 
290 295 300 



864 



r> £ A 



1008 



1056 



aac ttc aag ate cgc cac aac ate gag gac ggc age gtg cag etc y^c 

Asn Phe Lys lie Arg His Asn He Glu Asp Gly Ser Val Gin Leu Ala 
305 310 315 320 

gac cac tac cag cag aac acc ccc ate ggc gac ggc ccc gtg ctg ctg 

Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro Val Leu Leu 

325 330 335 

ccc gac aac cac tac ctg age tac cag tec gee ctg age aaa gac ccc 

Pro Asp Asn His Tyr Leu Ser Tyr Gin Ser Ala Leu Ser Lys Asp Pro 

340 345 350 

aac gag aag cgc gat cac atg gtc ctg ctg gag ttc gtg acc gec gec 1104 

Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr Ala Ala 

355 360 365 

ggg ate act etc ggc atg gac gag ctg tac aag taa 1140 

Gly lie Thr Leu Gly Met Asp Glu Leu Tyr Lys 

370 375 380 



<210> 20 

<211> 379 

<212> PRT 

<213> Artificial sequence 

<223> Artificial sequence description ; MY47 YFP 

<400> 20 

Met Ala Gly He Lys Ala Leu He Ser Leu Ser Phe Gly Gly Ala He 
15 10 15 



Gly Leu Met Phe Leu Met Leu Gly Cys Ala Leu Pro He Tyr Asn Lys 

20 25 30 



Tyr Trp Pro Leu 
35 

Tyr Cys lie Ala 

50 

Ala Cys Lys Glu 
65 

Ala Phe Gly Leu 



Gly Ala Cys Ala 

100 

lie Leu Gly Phe 
115 

Gin Gin Trp Arg 
130 

Gly Glu Glu Leu 
145 

Gly Asp Val Asn 



Asp Ala Thr Tyr 

180 

Lys Leu Pro Val 
195 

Val Gin Cys Phe 

210 

Phe Lys Ser Ala 
225 

Phe Lys Asp Asp 



Gly Asp Thr Leu 

260 

Glu Asp Gly Asn 
275 



Phe Val Leu Phe 

40 

Arg Arg Leu Val 

55 

Leu Ala He Phe 
70 

Pro He Val Phe 
85 

Leu Val Leu Thr 



Phe Leu Val Phe 

120 

Pro Val Asp Pro 
135 

Phe Thr Gly Val 
150 

Gly His Lys Phe 
165 

Gly Lys Leu Thr 

Pro Trp Pro Thr 

200 

Ala Arg Tyr Pro 

215 

Met Pro Glu Gly 
230 

Gly Asn Tyr Lys 
245 

Val Asn Arg He 



lie Leu Gly His 

280 



Phe Tyr He Leu 



Asp Asp Thr Asp 

60 

Leu Thr Thr Gly 

75 

Ala Arg Ala His 
90 

Gly Asn Thr Val 
105 

Gly Ser Asn Asp 



Pro Val Ala Thr 

140 

Val Pro He Leu 
155 

Ser Val Ser Gly 
170 

Leu Lys Phe He 
185 

Leu Val Thr Thr 



Asp His Met Arg 

220 

Tyr Val Gin Glu 
235 

Thr Arg Ala Glu 
250 

Glu Leu Lys Gly 
265 

Lys Leu Glu Tyr 



Ser Pro He Pro 
45 

Ala Met Ser Asn 



He Val Val Ser 

80 

Leu lie Glu Trp 

95 

He Phe Ala Thr 
110 

Asp Phe Ser Trp 

12 5 

Met Val Ser Lys 



Val Glu Leu Asp 

160 

Glu Gly Glu Gly 
175 

Cys Thr Thr Gly 
190 

Phe Gly Tyr Gly 
205 

Gin His Asp Phe 



Arg Thr He Phe 

240 

Val Lys Phe Glu 
255 

He Asp Phe Lys 
270 

Asn Tyr Asn Ser 
285 



Figure 1 



AS 01: 3 ' -tecr- G*G*G C*C *C*G G*C A*C*C G*T*C*GJTiTiC±G 

AS 02: 3 1 -teg-GlGlGlT^C A A G*C*C*C T*C*T G* T A*C*C*G 

AS 03: 3 ' ~teg-- G*C*C*C T *C*T G*T A*C*C G*C*C* C G*C*A*A 

AS 04: 3 ' ~tea- T*A*G*C G *C*C*C G*C A A*T*T*T* C G A*G*A 

AS 05: 3 ' -tea- T*T*C*G A G A*G*C A*C*C G*T A A*T A*G*G 

AS 06: 3 r -tea- G*A*A*T A *C G A*C*C*C*T A*C A* G G G*A*A 

AS 07: - 3 1 -tea- A*C*A*C G G A A*T*C*T C*C*T*A A*T A*C*C 

AS 08: ■ 3 1 -tea- C*T*C*C*T A A*T A*C*C G*C A A* A*T G*A*G 

AS 09: 3 * -tea~ G*G*G*G*A A A*T G A*C*C*G G G A*A T*A*A 

AS 10: 3 f ~teq~ A*C*G G A *C A G*C*G*C T*T*G A* G*G G*T*A 

AS 11: 3 ' -teg-G*G*A*C A G*C*C*C T*T*G A*G*C G*T A*T A*A*A 

AS 12: 3 r -tea- G*C*G*G T *T*G A*C*C G*T A*T A A*A G* A*A 

AS 13: 3'-teq- G*G*A A*G *A*C A A*C*C*G T*G*C* G T*T*A*C 

AS 14: 3 ' ~teg- T*G*T A*G A*C G*T G*T A*C G*G* G G*T A*A 

AS 15: 3 ' -teg-G*C*G*T C*C*T G*T C*C*A G*C*G*G G*C*A*A 

AS 16: 3 ■ -teg- G*G*A*C*G G A*C*A T*T*G C*A*G G*T C*T A*A*A 



*: Thioester 

_: 2 r -0-Methylation 

teg: Triethylene glycol spacer 
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Figure 2 

10 20 30 40 50 6 

I 1 I | | ! 

OB - RGHP_Jh.uman MAG - VKA LVAL SF S GAX G LTF L ML GCALE D YGVYWP L F YL XFHAXS 

7_human , ^MAG- ±E^LISLSFG<^lGLMFLHLGCALPimK:-OTPLFVLFFYlLS 

ytO 2 J3 . elegans MC CHIKX QCFD C C SMKNT-I I^VAALAF AGVVGLTFIiVIi GC ALP RYGTWTF MFVXTF YVL S 
YJ-l4_Yeast NIMEFKVS ? LTK 1 1 SL S GF LALGFLLVXL S CAL - - FHNYYPLFBXL XFLLA 

Consensus MGCHXHXQCF2 222^1AG2IKALI2LSF4GAIGLTFXiMLGCALP3 YG41T I ! ?PLFV24FY4LS 

70 SO 90 100 110 120 

I I ! I i 1 

OB-KG£P_numan PXPHFXAKR VT YD SB AT S £ ACREI^AYF F TTGX Y V S AF GFP V XL ARV AVI 

My4 7 human PXPYCXARR LVDDTDAMSN2L CKELAXF LTTG IVVSAFG L P XVF ARAHL-I E"WGACA 

ytO 3_C. elegans PVPLLIftKR - - E QE DMT CTN : - AC I EL ALF XTTG XV XS AF A L P XVL AilAGT I A M SAC F 

Y3-14_Yeagt P XPITTXFNAGNKYHT S DFMS D S SNTGQDL AHF LTGMLVT3 GXALP WF YHCQ L I GHL SC X 

Consensus - P XP 4 4 XARRGMKYH4 4 DDMDAT SNA C4 ELA4 F LTTGXWSAF 2 LP 2 V2A 2 A4 LI4WGAC4 

130 140 150 

I I I 

GB - RGRP Jxuman L VX* A GNAVXF L T XQGFF L XFGRGD DF £ WEQW- 

My4 7_numan L VLTGNTVIF A2!XIiGFFLVFGS KDDF 2 WQQW- 

Y t02 .S • elegans LXFXANSXKFSVIIFYF^IFNGiSD^TGMSLW- 

YCF14_Yeast MCMX GGL X XY S S XV J FKWFFKKDF3STED D 5 LFQ 

Consensus LVLXGN42IF STX4GFFLXFG4 4 DDFSWS 2 WG 
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igore 4 a 




G8.-RGRP-YFP - -f - - + - 
YFP-OB-RSRP - - * - - * 
Gpwss-fink - - - + * 



Figure 4B 



IF : Myc 




6MyoOB»RGRP - + - 4- 
5 % SDS - ~ + + 
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Figure 5 



OB-RGRP 




6Myc-08-RGRP 2TM - + 
6Myc-OB-RGRP 3TWI » 
SMycOB-RGRP 4TM - 
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Figure? 
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Figure 8 a 

STAT3 reporter gene 



3000-r 
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Figure 8b 
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Figure 10a 
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Figure 10c 
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Figure 11 
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Figure 12 
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